Gene Haur_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3053 
Symbol 
ID5734925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3857096 
End bp3858433 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content53% 
IMG OID641280197 
Productinner-membrane translocator 
Protein accessionYP_001545819 
Protein GI159899572 
COG category[R] General function prediction only 
COG ID[COG1079] Uncharacterized ABC-type transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CACGCGCACG CGGGGTTGGC CTGCTCGGGA TTTTCATCGC GATTGTCCTA 
ATTTTTGCAG TCAATAACAG CCTCGGCAAT AAAATCACTG AAAAAAGCAA AATCCAATCC
ATTTGGGATT TTACCAGCAC GATTCGGATT CCCGATAGTG CCAACATGAA GGTAGATCCC
AAAGCTCCAC CAGTCGCCGC GATCGAACTG CCCACCATGA TCACGATTAG CTTGATCTGT
GGAGCTTTAG TGATCGCAGG TGGCACGCTG ATCGCCAAGA ATCCTGAACG TGGTTGGGTT
GCTCCCTATA TGAGCAGCGT GATTGCCCTT GGTTTCTTTT CGATCTTGCT CTGGGCAGTC
GCCAACAACA GCGTTGACCT CTCGGATACG CTTTCGCGGG TGGTACGGCT TGGCACGCCA
ATCGCAATTG GAGCGTTGGC AGGGATTGTC TGTGAACGCT CAGGTGTGGT CAACATCGGG
ATCGAAGGCA TGATGCTGAC GGCAGCCTGT TTTGGCTATG CTGCTGCCGC GATCTCAGCC
CGCGCAATTT ATGGCGATGC CTTACCCGAA AGCGCCGTCA GAGGCGTGGT TTGGATGCCC
TTGATCATCG GGATTTTGGG CGCAATCGTG ACGGGCGGCA TGATGGCGGC CTTGCATGCG
TGGCTCTCAA TTCGCTTCAA AGTTGACCAA GTGATCAGCG GTACGGTGAT CAACATTATG
GCAGTTGGTA TTACCGGCTT TACCCGCACC AACTTTTTGC TCAAATTTGA ATCGCCACTC
CGCACGGGCT TGCCACAAAT GCCGCTTGGG CCATTGGCCG ATATCCCATT GCTTGGGCCA
ATTTTGTTTG ATCACAAACC AATCACCTAT TTGATGATTC TGATGGTGTT TGGCTTAAAC
TTCTTCCTGT TTCGCACAGT GTGGGGCTTG CGTACCCGCG CGATCGGCGA ACACCCCAAA
GCTGCCGATA CCGTCGGGAT CAACGTCAAT CGCATGCGCT ATCGCAACGT GATTATTGGC
GGGATGATCG CTGGGGTGGC TGGGGCTTGG TTCTCACTGG AAGGCTCGTT TGGCTTTGAT
GATGGTATGA CCAGCGGTCA AGGCTTTATC TCATTGGCCG CGATGATCTT CGGCAAGTGG
AATCCAATTG GCGCTTTTGG TGGGTCGTTG CTCTTCTCAT CTGCCGATGC CTTGCAGCTC
AAAGTGCAGG CTTATAGTTT CGATTTGCCC TCGCAATTTA TGCAGATGTT GCCGTATGTG
GTAACCCTGA TTGTGTTGGC TGGGGTGATT GGCCGCGCCC GACCACCAGC CGCCTCAGGC
AAAGTGTACG AAAAGTAA
 
Protein sequence
MKFTRARGVG LLGIFIAIVL IFAVNNSLGN KITEKSKIQS IWDFTSTIRI PDSANMKVDP 
KAPPVAAIEL PTMITISLIC GALVIAGGTL IAKNPERGWV APYMSSVIAL GFFSILLWAV
ANNSVDLSDT LSRVVRLGTP IAIGALAGIV CERSGVVNIG IEGMMLTAAC FGYAAAAISA
RAIYGDALPE SAVRGVVWMP LIIGILGAIV TGGMMAALHA WLSIRFKVDQ VISGTVINIM
AVGITGFTRT NFLLKFESPL RTGLPQMPLG PLADIPLLGP ILFDHKPITY LMILMVFGLN
FFLFRTVWGL RTRAIGEHPK AADTVGINVN RMRYRNVIIG GMIAGVAGAW FSLEGSFGFD
DGMTSGQGFI SLAAMIFGKW NPIGAFGGSL LFSSADALQL KVQAYSFDLP SQFMQMLPYV
VTLIVLAGVI GRARPPAASG KVYEK