Gene Haur_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0221 
Symbol 
ID5732116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp257239 
End bp258852 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content50% 
IMG OID641277345 
Productperiplasmic component of the Tol biopolymer transport system-like 
Protein accessionYP_001543001 
Protein GI159896754 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAT CGTGTGTCGT TTGTCGAGCC GCGCTTGCCG ATGGCTCGGT CTATTGTTCT 
GAATGTGGCT CGCGGCAGCC CGCCGCAGGT CAATCAACCC AAGTGTTGGG CAGCGCGGTT
TTCGGTTCGC AGCCCTTACC AAGCAGCAGC GATCATGATG ATTCGCCGTA TGCGCCACGG
CGGAGCGGCA CGCAACCGCT AAATGATCCA CCAACCCAAG TCGTCAACAA CTACCAAAGC
CCTAGCTCAA CCAGCAATGT TGGTGGCTTT GGGCAAGGGC CATCAACCCA ACCCAATTTT
CCGTTACCTC AATATACCCC GCCGACCCCA CCCAACTATG CCCAACCTAC TGCAAAATCG
AAGAACCTTG GGCATTGGTT GATTGGTGGT GGCTTGGCCT TGCTGTTGAT TGGCGGCGGC
GCAGGGGCTT ATTATTTCCT CGGCAACAAC GATTCAAACA ATGGCGAAAG TGGCAATGTT
GCGACTGGCC CAACCTGGAC TCCGATTCCA ACCAAAATCG CCGAGCCAAC CGAACAAGCA
GCTGGCGATC CACTGGTTGA GCCAACCAGC GAGCCAACTG TTGAGCCAAC TCAAGGCCAA
AGTGATCCGG CCGATCAACC AACGCCAACC AATGAATCCA ACGATTTGCC AACTCAAGCG
GCTGGTGGTA GCGCCCCAAC TGATCTCACT GGCGAGCTGA TTTATCTTGA TGATAGCTTT
GAATTGGTGC GCCAAACCAT GAGCACTGGC TCAGTCAGCC CGCTTGATCT TGGCGGACAA
GCCTATTACA ACGATTTACT GAGTTGGTCG CCTGATGGCA AAACTATGGC CTTTTTTGTA
CGCGACGGCG CAAAAACCAA CATTTATTTA GCCGATGGTG ATGGCAGTAA TAGCCGCAGC
GTGATTGAAT TGCAGAATAT GGCTCCCCAA AGCTTGAGCT GGTCGCCTGA TAGCAGCAAA
TTTGCCTTTA TCACCAGCGA TATCGATTTT GAAACCAAGG AAGATCAAAA TCTATATGTT
TATGATCTGG CCAGTAATAA CGAAAAACAA CTGACAACCA CCGGCTTGAT CGATTTTGAT
CCGTTGAGTT GGTCGCCTGA TAATCAAACG ATTTTGTTTG CTGCTGGTCA GGATGGTATC
GAATTCAATG TGATTAATGC TGATGGCAGC AATCTCGCCA AACTTGCCGA TGTGTTTACT
TCTGATGCAT GGTGGACCAA GGATAGCAAA ATCATCTACG ACGATTTTTG CGATCGTTCA
AACTTTGATC GTGGGGTTTG TTTGCTCGAC CCAGCCACGG GCGAGGTTGA AACATTGCAA
AAAATTGGCG ATTTTTATCT GAGCGGAATT TCGCCCGATG CCAATTGGTA TATGCTCGAT
AATTATAACG ATGGCTCGTT GCTGCTGATC AATGCCGATA CTGGCGATAA AGAATTAGTT
GCGCCACCAA GCAGCGGCGG TTCCTACGAA GTTCGGGTGT GGGGCAAGTG GTCACCCGAT
GGACGTTATG TCACCTACGA AACGATTGGC CAAGGCACGT TTATCTATGA AATTGGCTCG
GGCCAAGCCG CCGCACCTTT CGTCAATGGC TCAATTATGG AGTGGCTGCC ATAA
 
Protein sequence
MAQSCVVCRA ALADGSVYCS ECGSRQPAAG QSTQVLGSAV FGSQPLPSSS DHDDSPYAPR 
RSGTQPLNDP PTQVVNNYQS PSSTSNVGGF GQGPSTQPNF PLPQYTPPTP PNYAQPTAKS
KNLGHWLIGG GLALLLIGGG AGAYYFLGNN DSNNGESGNV ATGPTWTPIP TKIAEPTEQA
AGDPLVEPTS EPTVEPTQGQ SDPADQPTPT NESNDLPTQA AGGSAPTDLT GELIYLDDSF
ELVRQTMSTG SVSPLDLGGQ AYYNDLLSWS PDGKTMAFFV RDGAKTNIYL ADGDGSNSRS
VIELQNMAPQ SLSWSPDSSK FAFITSDIDF ETKEDQNLYV YDLASNNEKQ LTTTGLIDFD
PLSWSPDNQT ILFAAGQDGI EFNVINADGS NLAKLADVFT SDAWWTKDSK IIYDDFCDRS
NFDRGVCLLD PATGEVETLQ KIGDFYLSGI SPDANWYMLD NYNDGSLLLI NADTGDKELV
APPSSGGSYE VRVWGKWSPD GRYVTYETIG QGTFIYEIGS GQAAAPFVNG SIMEWLP