Gene Haur_3339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3339 
Symbol 
ID5735209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4208713 
End bp4210575 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content54% 
IMG OID641280486 
Producthypothetical protein 
Protein accessionYP_001546103 
Protein GI159899856 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGCT GGCGAATCTG CCTTTTGCTG TGGATGTTGG TGGGTTGTAG CGCAACCGCA 
CCGACAACCA GCACCCAAAT TCCAATTCAA CCAAGCTTGC CAACTCCAGC TCAAACTCTC
AGCCCGACTC AAGCATTGGC CACAAGCATC CCAATAATTG CCGATGAAGC GGCGGTGGAA
CGAGCCAGCC AACAAGGGCT TGAGCGCGAT TTAGCCCAAC TTGCCGTTGA TTGGCGCTTG
ATTAGCGAAA AACCTCAACC ACTGCGTTTG GAAATGCCCC CACCATACGA GCGCCGTAGC
TTTTGGGTTA CCGATTTAAC CAGCAATCAA CAGCGCAATA TCAGTGCCAC CCTTCAACTC
AGCACCACCC ACTTATTAAT TTATGTCGCC GATGATTTGC CGGTTGAGCA ACAAGCGCTG
ATTAATGCTG CTCAGCAATT TGAACAGGTT GGTTGGCCGT TGCTAGCCAA ATGGTATCCG
CAACAGGCTT GGCCCCAAGT ACCTGTAACC GTCTTGAATG CTGCGGTCAA CGGGGCGGGC
GGCTATTACG CCAGCGATAA CGAATTACCC CAAGCGATCA ATCCATATTC CAACGAACGC
GAGATGTTGG TGATTAACGC TGCGGCCATG CCACCCAGCG ATTTTGGCTA TGTCGCCACG
TTAATTCACG AAATGCAGCA TCTGTTGCAT CGGAATGTGC TGAGCCACCC CGCCACTTGG
CTCAACGAAG GCGCTTCGAT GTTGAGCGAA GATCGTTCAG GCTATAGCAA CGATAGCTTG
GCACTCGATT TTCTGGCCTC GCCGGATACC CAACTCAATG CGTGGGCCAG CAGCCCTGGC
ACTGCGCTCA AACATTATGG CGCGGCTCAG CTGTTCCTCA GTTACCTTGA TCAGCAACTC
GACGGCTTGC CGATGGGAAC CTTGGCGGCG GCTGATGCTG GCGATAATTT GACCAGCATT
ACCAGTTTGA TGACCACCCG CTATCCCGAT TTAACCAGCT TTGATCAGCT GTTTGCGGCG
TGGGCAGTCG CCAATTGGGT GAATGATCCA ACGGTGGCTG ATGGCCGTTA TGGCTACGAT
CTGCCCCGCG CTGTGTTGCC AGAGCAGGCC CAGAGCAGCG AACAAAACCT GAGCATTCGG
CAATTTGGCA GCGATTATTT GGCCTTTGAG AACGCCAGTA GCGAACGAAC GCTCGAATGG
CAGGGCAACA ACACAGTGCC TATTTTCGCC GCCGATGTGA CAAGTAGCGC CACATGGTGG
AGCGGGCGTG GCGATGCGCG GGTCAGCACG CTCACCACAG CAATTCAAGT GCCTAGCGCG
GGCGGCAGCC TGATTTATCG ACGGTGGTTT GATTTAGAGC AAGATTACGA TTATGCCTAT
CTCAGTCTTT CGCAAGATAA CGGCCAAACC TGGCAAGCAA TTGCGACCCA AGCCAGTACT
GGAGCCAATC CGGTTGGCTT GAATATTGGG GCTGGCTGGA CAGGCCAACA AACCACGTGG
CAAGCAGAAA GCGTTGATCT CACGCCGTGG GCAGGCCAAC AGATTCAATT GCGATTTTGG
GTGATCAACG ATGAAGCGTA TAATGCTGCT GGTTTAGCCT TGAGCGATCT GACAATCGAT
GGGGTAACGG CTGAATGGGT TGGGACTGGC TTTGTGCCAG TTCGTAATCA ATTGGCACAG
CGTTGGGTGC TCACGGCGGT GCTCTATGAT CAAGCTGGGG TTGCGGAAGT TGTCTCAATT
CCAACCGATA ATGGCCAAGC GCGTTGGCTG ATTCCGGCCA ATCGGCGAGC GGTTTTGGTG
GTCAATGCCA CAACTCAAGG CACCACCGAA GCAGCCAATT ACAGCTATAA CGTCACACCG
TAG
 
Protein sequence
MRRWRICLLL WMLVGCSATA PTTSTQIPIQ PSLPTPAQTL SPTQALATSI PIIADEAAVE 
RASQQGLERD LAQLAVDWRL ISEKPQPLRL EMPPPYERRS FWVTDLTSNQ QRNISATLQL
STTHLLIYVA DDLPVEQQAL INAAQQFEQV GWPLLAKWYP QQAWPQVPVT VLNAAVNGAG
GYYASDNELP QAINPYSNER EMLVINAAAM PPSDFGYVAT LIHEMQHLLH RNVLSHPATW
LNEGASMLSE DRSGYSNDSL ALDFLASPDT QLNAWASSPG TALKHYGAAQ LFLSYLDQQL
DGLPMGTLAA ADAGDNLTSI TSLMTTRYPD LTSFDQLFAA WAVANWVNDP TVADGRYGYD
LPRAVLPEQA QSSEQNLSIR QFGSDYLAFE NASSERTLEW QGNNTVPIFA ADVTSSATWW
SGRGDARVST LTTAIQVPSA GGSLIYRRWF DLEQDYDYAY LSLSQDNGQT WQAIATQAST
GANPVGLNIG AGWTGQQTTW QAESVDLTPW AGQQIQLRFW VINDEAYNAA GLALSDLTID
GVTAEWVGTG FVPVRNQLAQ RWVLTAVLYD QAGVAEVVSI PTDNGQARWL IPANRRAVLV
VNATTQGTTE AANYSYNVTP