Gene Haur_4564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4564 
Symbol 
ID5736409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5841707 
End bp5842744 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content48% 
IMG OID641281726 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001547323 
Protein GI159901076 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCGG CAGAAAGTAG CGTCGAGCGT GCTAGTGATA GCGCCATCCC ACGACCTTCG 
TGGTGGGCAC GCACCAGAAC CTCGCGAACA GCCTATACGT ATCTTTTTCC CGCTCTGATC
GTGATGTCGA TCATCACGTT CTATCCCATT CTCTACCAAT TTTGGATGTC GCTAACCGAC
TTTGGCCCAT CCAGCATCAA CCCTCTTGCC AAAAACTATA CGCCACCCAA ATATGTGGGC
TTTGAAAACT ATCAATTAAT TTTGCAAGAT AAGTTGGCTA CCAAAAATGC CGACTTAGCC
AGCTTCAAAT TCTGGCGCAC CTTGGGCTTC AACATTTGGT GGACATTCTC GAATGTGATT
TTTCACGTTT CACTGGGGAT CGTCATCGCC GTGATGTTGA ACGTCGAGGG TTTGTGGTTT
AAAAAGATCT ATCGCGCAAT TTATATCTTA CCCATGGTGC TGCCACAGTT GGTTATTGCC
ACGATTTGGC GCAATATGTT CGATGGGCAA TATGGCGCGA TCAATTTGAT GTTGAAGATC
TTTCTTGGCC CAGCCTTCCC CAGCGGTGGG ATTGATTGGC TCCAGCGGAT TGAGCCAGTC
GCCTTTGGCT TACCACTTTC GTACTTTGCC ATGTTGATCG CCAATATTTG GCTGGGCTGG
CCGTTTATGA CGATTGTGGC CACTGGTGCA CTCCAAAGTA TCCCCAAAGA ATTGTATGAG
GCGGCCTCGA TCGATGGCGC AACTGGCTGG AATAAATTCT GGACAATTAC CCTGCCATTG
CTCCGCCCCG CCATGGTTCC GGCAACCATT TTGGGGATTA TCCAAACCTT CAATCTGTTC
CATGTGATTT ATTTTATCAG CGGCGGCGGC CCACTCGGCC AAACCGAAAT CTTGGTGACC
CAAGCCTATA AGCTGATCAA TAATAACTCC TTGTTTGGGA TTGGGGCCGC ATTTAGCGTC
TTTATCTTTA TTATTCTTGG CTGTATCTCG GCAATTACCG CCAGAATTTC ACGAGTAGCG
GAGTCATACG ATGGCTAA
 
Protein sequence
MASAESSVER ASDSAIPRPS WWARTRTSRT AYTYLFPALI VMSIITFYPI LYQFWMSLTD 
FGPSSINPLA KNYTPPKYVG FENYQLILQD KLATKNADLA SFKFWRTLGF NIWWTFSNVI
FHVSLGIVIA VMLNVEGLWF KKIYRAIYIL PMVLPQLVIA TIWRNMFDGQ YGAINLMLKI
FLGPAFPSGG IDWLQRIEPV AFGLPLSYFA MLIANIWLGW PFMTIVATGA LQSIPKELYE
AASIDGATGW NKFWTITLPL LRPAMVPATI LGIIQTFNLF HVIYFISGGG PLGQTEILVT
QAYKLINNNS LFGIGAAFSV FIFIILGCIS AITARISRVA ESYDG