Gene Rcas_4421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4421 
Symbol 
ID5541934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5682248 
End bp5683552 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID640896519 
Productmajor facilitator transporter 
Protein accessionYP_001434455 
Protein GI156744326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0230065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.999715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCACA TTATCGAACG CTGGCGCGCA CTGAAAACCG ACGCGCGTCG CTATCTTCTG 
CACCTGGCGA TCCTTACGCT CAGCCTTTCG GTGGTGGGGC TGTACTACAA TCTGGCGGTG
CTGGCGCTCG GTTATTCCCG TGGCTTCCTT GGCATCCTGC AAACAGTCAC ACAGGCGGTT
GCGGCGCTGT TGAGCTTGCC GCTCTGGTTG GGCGTGCAGC GCCTGGGATT GCGCGCTGCG
ATCATAACGT CGGTGGCGTT GTATGCCGCC GGCATTCTGT TGTTCGCCGC TTTTCCGCAG
GCGGCGCTGC TCATCTGTTC TGCCGCTCTG ATGGGAATGG CGTCCGTGCT GATGCAGGTG
ACTGCTGCAC CTCTCATGAT GCGCCTCAGC AACGACGCCA CTCGCGATGA TCTGTTCAGC
GCCAGCGCCG CAGTCGCCGT TGGGTTGAGC GGCATCGGCA ACCTGGGCGC AGGGTTCATC
GCCGATACGC TGAGTCACGT GCTTGGCGCT CCCGCCGACA GTGGCGCGAC CTATCGCGCC
GTTTTTGCGA TCGCCGGGAT TGGGGTACTG GCATCGCTTC CACCGTTGTT CCTGATTCGT
GAACCGGAAC GAGCGCCAGA GTCACCCTTG CCTGAAAGCG GCGCCTCACC TTCGGGTACA
GGATCGTTCT TCGGTGGTCT TCCCTCCTGG CTGTTGGTTG CCCAACCATG GTTCTGGCGT
CTTCCGACGC CACTACGCGC TTTGTTGCGC AAGCCAGCGT TTGTGGCGAA ACTGCTGCTG
CCGCCGTTCC TGATCTCGTG GGGCGCGGCG TTGCTCATCC CTTACCTCAA CCTCTACTTC
CGCGAACGGT TTGGCCTCGA CAATCGCGCG CTCGGCGTAC TCTTCGCCGG ATTCGATGTG
GCGACCGGTC TGGCGTCATT GACCGGTCCG GCGCTGGCGG CGCGGCTGGG GAAGATGCGC
ACCGTGGCGG TGACCCGCGC TATCGGCGTG CCGTTGCTGC TGATCCTGGG AGCGGCGTCG
GATCTGTGGC TCGCAATTGC GGCGGCGCTG GCGCGGGTGA TGGCATCCAA TATGGCATCG
CCGCTCTACG ACGCCTATGC GATGGAACAG ACTGAGGAAA GTGCGCGTCC GTTCATGATC
GGTCTGCTGG GAGCAGCGTA CAGCATCGGG TTTCTCATTG CTCCCTTGAT CAGCACTTTT
GTTCAGGAAC GGTATGGCTT TGGTCCGCTG TTCGTCGCAA CAGCGATCCT GTATGCACTC
TCGGTTGCTG TCACATGGTG GTTTTTTGGG AAAGAGCGCC CTTAA
 
Protein sequence
MHHIIERWRA LKTDARRYLL HLAILTLSLS VVGLYYNLAV LALGYSRGFL GILQTVTQAV 
AALLSLPLWL GVQRLGLRAA IITSVALYAA GILLFAAFPQ AALLICSAAL MGMASVLMQV
TAAPLMMRLS NDATRDDLFS ASAAVAVGLS GIGNLGAGFI ADTLSHVLGA PADSGATYRA
VFAIAGIGVL ASLPPLFLIR EPERAPESPL PESGASPSGT GSFFGGLPSW LLVAQPWFWR
LPTPLRALLR KPAFVAKLLL PPFLISWGAA LLIPYLNLYF RERFGLDNRA LGVLFAGFDV
ATGLASLTGP ALAARLGKMR TVAVTRAIGV PLLLILGAAS DLWLAIAAAL ARVMASNMAS
PLYDAYAMEQ TEESARPFMI GLLGAAYSIG FLIAPLISTF VQERYGFGPL FVATAILYAL
SVAVTWWFFG KERP