Gene Haur_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0007 
Symbol 
ID5736841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp8321 
End bp9760 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID641277128 
Producthypothetical protein 
Protein accessionYP_001542787 
Protein GI159896540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000446047 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGAA TCCGCGCTTC ATTCCGTCTC GCCATTGTTC TATTGGCAAT GTTGTTCCTC 
GTTGGCCCCG TTTTTGTAGC TCGGGCACAC GTTAGCCCAG CCCAAACCCA AGCTCCTGCC
ACTCCCAACG ATACTCAGTA TCTATGGATT GCCGGCAGTT CGTTTCAAAC CCGCGACTCA
ACCACGGCGT TTGAAACCAC TCGTAACGTT AATAATCAAC CAACTGGTTG TATTTATGCC
ACCAGCAGTG GCGAGTTTAC CGCCCCAGTT GCCGTGCCCA ATGGCGCAAC GATCTTGGGC
TTGGATTACT ATCTCTATGA TACCAGCACG ACGCAAACCA AAGCTGAGTT GACCCTCAAC
GACAGCGATA TGGTTACTCC CCGCGAATTG ATCACGGTCG AAATTTCAAG CTCGGTAGGC
CTGACCAACA CCTACGCCCC AATGGGTGCA CTCTATGCCC CGTATGTGGT CAATATGCAA
ACCCGTGGCT TGTTCTTAGA GTGGTTTCCC CGCGTAACCA ATGCGGCCAT GCAATTATGT
GGTGTGCGGA TCGCCTACAC CGCGCCAGCC GAACCACGTC CATCGACTGA TTATCTGTTT
ATCGTTGGCA GCACCTTGGT CAACACCAAC TCTAGCACCG AACACGGCTA TGCTGGGGCT
GGCTGTACCT TTGTCAAAAT CAACGGGCGC ACGCTCAATG CCGATGTGGA TTTGCCTCAA
GGCAGCCAAC TAACGGCGGT ACGCAGCTAT TTCCGCGATG TCAATAACGC TGATCTCACG
GTCAAATTAA TTGCCTCAAA TGGCCAAGGC GTGACCAATA CTCTGGCCAC CCTCACCAGC
CCGGTGTCGA ATACAGCGGT GGTGAATGCT GATCAATCGT TGAATTACAC GGTTAACGAA
AGCAGCGAAT CGTTGAGCGT TTTGGCCGAT TTTGGTGGGG TGTTGAGCAA CCAAATTCGC
TTGTGTGGGG TGCGTTTCCA ATATACCAAC CCCAGCGCTA AGCCAACCCA AGATAGCCGC
TTCATCACTG GGAGCACCTT CGTGCCCCGC CGCTCGAATG TCAGCTACAC GAGCGATGCC
AATGGTTGTG TCAATGTAAG CAACGAAGTT GAAGATTTGA CCACCAATGT AACTGCGCCC
GAAGGAGCCA AGGCTGCCCG CGTCACCTTC TACTACAAGA ATGCTGCGGC AGGCCCAACC
CTCAACCTCT ACAGCTTTGT TGGCAGCGGC GATTTTACGT CAATCACCAG TGTACCAGTG
ATGGGAACGG GTACTCAGAA CGCCGTCAAC ATCAATTATC CAATTGAAAA CGCCGAAAAA
GGCTTTGCCT TGGTGTGGGA TGCGTCGTCG GCAAGCAGTG AATATGCCTT GTGTGGTGCG
AAGATCGACT TCCTCTACAC CCAACAGGTC TTCTTGCCAG CCGCTATGAA CAACTACTAA
 
Protein sequence
MSRIRASFRL AIVLLAMLFL VGPVFVARAH VSPAQTQAPA TPNDTQYLWI AGSSFQTRDS 
TTAFETTRNV NNQPTGCIYA TSSGEFTAPV AVPNGATILG LDYYLYDTST TQTKAELTLN
DSDMVTPREL ITVEISSSVG LTNTYAPMGA LYAPYVVNMQ TRGLFLEWFP RVTNAAMQLC
GVRIAYTAPA EPRPSTDYLF IVGSTLVNTN SSTEHGYAGA GCTFVKINGR TLNADVDLPQ
GSQLTAVRSY FRDVNNADLT VKLIASNGQG VTNTLATLTS PVSNTAVVNA DQSLNYTVNE
SSESLSVLAD FGGVLSNQIR LCGVRFQYTN PSAKPTQDSR FITGSTFVPR RSNVSYTSDA
NGCVNVSNEV EDLTTNVTAP EGAKAARVTF YYKNAAAGPT LNLYSFVGSG DFTSITSVPV
MGTGTQNAVN INYPIENAEK GFALVWDASS ASSEYALCGA KIDFLYTQQV FLPAAMNNY