Gene Haur_4272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4272 
Symbol 
ID5736131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5453473 
End bp5454909 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content47% 
IMG OID641281432 
Productmembrane protein-like protein 
Protein accessionYP_001547032 
Protein GI159900785 
COG category[S] Function unknown 
COG ID[COG3463] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAT TTTTTGCAAC GATCGATACA TATGCCAAAG CGCTGCTTAC AACAATGATT 
GTGGGCTATA TCGGGGTATT CGCTACACTG TCGTGTCTAA AATTGGCATG GTTTCGCCAA
GGCTTCGATA TGGCGGGCAA CGAGCAAACG ATTTGGAATA CCTTGCATGG GCGGCCATTC
CAAATTTCGG TGTTTGCCAT GATGCGCTAC GATTTTGATG ATGGCCCTGT GTTATTGCAA
TTGCCCTTAG CATTATTGTA TGGCATATAT CAATCGCCCT ATACGCTGCT TGTGTTGCAA
ACAATTGCTT TAGGCATTGC TGCTTGGCCA TTGTATGCGA TTGTGCGTGA TCTTTTGCCA
AAGCCATGGC ATGCCTTGGT GATTGCTGCC ATTTATTTAT TACACCCCAC AACTCAGCAT
ATCAATATGT ACGAGTTTCA ATTGCGCTCA TTTATGATTC CATTTGCCTT GGCAGCATTA
TTATATTTGC GCCGTGAACG CTTGGGATTG TATTGTTTAT TTCTATTTTT GATGATGTGC
ACCAAAACTG AGGCAGGTTT TACGTTAATT GCCTTTGGTT TATATGCAGC TTGGCAGCGC
AAGCCTTGGA AATTTATTGC ATTTCCTTTG GTGCTAGGGC CAGCCTGGGT TGCGGTGGCT
TTGGGCGTAA TTGTTCCAGC CTTCAGCGAG GGTAATTTTA TTGCTGATAT TTACAGTTAT
GGCAGGCTTG GCAAAACTGT TGGCGATGTG ATCACAACTA TGCTGACCAA TCCAGCCCTT
GCTTTTAGCG TCATGACCGA GCCGCCCAAA CTCAAATATT TGTGGCAATT GTTTGGGCTT
GGCGGCTTTT TGGCCTTGCT CAGCCCAACC TTGCTGTTGG CATTACCAGT TTTAGCGCTC
AACTTAATCT CGCCCAATGC AGTCCAATTC AGCCTAAATT ATCAATATGG CTCGTTAGTT
TATCCATTTT TGCTCGTGGC CTCGGTCGAA GGTTTGCTGA ATCTCACCCG CTGGACAGTG
CGCAACTCGC AATGGCGCGA ACGGGCAGTG CATGGCGCGG TGCTGGTTTT ATTGCTGATT
GGGATTATTG GTAATTTGAC CTTGAATAAT GTGGTAAAAA CCGCTTTGAG CAACCGCGAA
AATCCAACAC GGGTAGCCGA TGCCCGGGCA ATTTTGGCTC AAGTGCCTGC CGATGCCGCT
GTTGCCGCCA GCACCTTTCT GGCACCACAC CTCGCCCAGC GCCAAGAAAT CTACTTCTTT
CCAGGCAATA AATCGTATCC CGCTGAATAT ATTGAGCGAG CCGAGTATTT GGTGTTTGAT
CGGCGGCCAC CAGGCAATAG CGCCGAAACT CGCGCTGCAA TCGAGCGCTA TTTGAACGAT
CCTGACTGGG TGATTGTGGC CGAGGCTGGC GATTTTGCCT TATTAAAGCA GCAGTAA
 
Protein sequence
MQRFFATIDT YAKALLTTMI VGYIGVFATL SCLKLAWFRQ GFDMAGNEQT IWNTLHGRPF 
QISVFAMMRY DFDDGPVLLQ LPLALLYGIY QSPYTLLVLQ TIALGIAAWP LYAIVRDLLP
KPWHALVIAA IYLLHPTTQH INMYEFQLRS FMIPFALAAL LYLRRERLGL YCLFLFLMMC
TKTEAGFTLI AFGLYAAWQR KPWKFIAFPL VLGPAWVAVA LGVIVPAFSE GNFIADIYSY
GRLGKTVGDV ITTMLTNPAL AFSVMTEPPK LKYLWQLFGL GGFLALLSPT LLLALPVLAL
NLISPNAVQF SLNYQYGSLV YPFLLVASVE GLLNLTRWTV RNSQWRERAV HGAVLVLLLI
GIIGNLTLNN VVKTALSNRE NPTRVADARA ILAQVPADAA VAASTFLAPH LAQRQEIYFF
PGNKSYPAEY IERAEYLVFD RRPPGNSAET RAAIERYLND PDWVIVAEAG DFALLKQQ