Gene Haur_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3051 
Symbol 
ID5734923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3854194 
End bp3855729 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content52% 
IMG OID641280195 
ProductABC transporter related 
Protein accessionYP_001545817 
Protein GI159899570 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGCGA ACGTCGTACC TGCGTTGGAA GTCCGCAATA TCACCAAGCA ATTCCCCGGG 
GTTTTGGCCA ACGACCAGAT CTCTTTTACC CTACTCCCCG GCGAAATCCA CGCCTTCCTT
GGTGAAAATG GCGCGGGTAA ATCCACCCTC ATGAACATCT TGTACGGCTT GTATGACCCC
GATCAAGGCG AGATTTTGAT CAACGGCGAA TTGGCCAAGA TCAAAAATCC GGCTGATGCG
ATTCGGCTTG GTTTAGGCAT GGTGCACCAA GAATATGCCC TCGTTGATAC CCTAACCGTC
ACCGAGAACA TTATTTTGGG CACCGAATTG CGCAATGGCC CAATTCTCAA TGCCAAAGAA
GCCAGTCAAC GGGTGCGCGA GCTTTCCGAG CAATATGGCT TGGCCGTGCC ACCCGATGCC
ATCGTCGGCG ATTTACCTGT CGGGGTGCAG CAACGGGTCG AAATTTTGAA GGCGCTCTAT
CGCAATGTCA ATATGTTGAT TCTCGATGAA CCAACTGCCG TATTGACTCC CCAAGAGGCC
GATGACCTCT TTCGGGTTAT GCGTGAATTA GCCGATCAGG GCAAGGCGCT GATTTTTATT
ACGCACAAAT TACGCGAAGT GCGCGAAGTT GCCGACCGAA TTACGGTGTT GCGCCGTGGT
AAGGTTGTCG GCGGCGCAGA CCCCAAAACT GCCAGCGAAG GCCAATTAGC CGCGCTGATG
GTTGGTCGCG ATGTTGTCTT GAAAGTAGAT AAAACTGCGG CCAATCCTGG CGATGCGGTG
CTCAAAGTTC GTAATCTTAA ACTGCGCGAT GATCGCGGAT TAACTGTGCT TGATGATGTC
TCGCTTGATG TCCATGCTGG CGAAGTGTTG GGGATTGCCG GGGTGCAAGG CAACGGCCAA
ACTGAATTGG TCGAGGCGAT TACTGGCCTG CAACATCTGC CAACCGGCTC GATTGAGTTG
CTCGGCAAAG ATATTACCAA TCGCCCGCCA CGATCGATCA CTGAACAAGG GGTCGCCTAC
ATTCCCGAAG ATCGCAAAGG CACAGGCTTA GTGCTTGGCT ATAGCATCCG CGATAATTTG
GTGCTTTCAA CCTATTACAA AGCGCCATTT ACCAAAGGCT TGACCTTCAA CGATGCCGCA
ATCGACGAAA ATGCTAAAGA ATTGATCGAA ACCTTTGATG TGCGCACACC CAGCGCCACC
ACCACGGCTG GCTCGCTCTC AGGTGGTAAT CAACAAAAAA TTATTGTAGC ACGTGAATTT
ACCCGCGAAA ACCGCTTGCT GATTGCTTCG CAACCAACGC GGGGGATTGA CGTTGGCTCG
ATTGAATTTA TTCACAATCA AATTATTAAA CAGCGCGATG CAGGGCTAGC GGTCTTATTG
GTTTCGGCAG AGCTTGATGA AATTCTGTCC TTAGCCGACC GCGTTGCTGT GATGTACCAC
GGCAAAATTG TTGGCATTCT CCCGATTGCC GAAGCTACGC GCGAGCGCGT TGGCCTGTTG
ATGGCCGGTA TTCAGGAGCA ACCCAATGGC AAATGA
 
Protein sequence
MTANVVPALE VRNITKQFPG VLANDQISFT LLPGEIHAFL GENGAGKSTL MNILYGLYDP 
DQGEILINGE LAKIKNPADA IRLGLGMVHQ EYALVDTLTV TENIILGTEL RNGPILNAKE
ASQRVRELSE QYGLAVPPDA IVGDLPVGVQ QRVEILKALY RNVNMLILDE PTAVLTPQEA
DDLFRVMREL ADQGKALIFI THKLREVREV ADRITVLRRG KVVGGADPKT ASEGQLAALM
VGRDVVLKVD KTAANPGDAV LKVRNLKLRD DRGLTVLDDV SLDVHAGEVL GIAGVQGNGQ
TELVEAITGL QHLPTGSIEL LGKDITNRPP RSITEQGVAY IPEDRKGTGL VLGYSIRDNL
VLSTYYKAPF TKGLTFNDAA IDENAKELIE TFDVRTPSAT TTAGSLSGGN QQKIIVAREF
TRENRLLIAS QPTRGIDVGS IEFIHNQIIK QRDAGLAVLL VSAELDEILS LADRVAVMYH
GKIVGILPIA EATRERVGLL MAGIQEQPNG K