Gene Haur_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3048 
Symbol 
ID5734920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3849483 
End bp3850709 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID641280192 
Producthypothetical protein 
Protein accessionYP_001545814 
Protein GI159899567 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.756675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGAC GCATTCGTTG GCTACTGTTA ATCGGTGGAG TTGTTGGCCT CGGTTTGGCG 
GTTGGAATCG GCTGGTGGCA ACGCGATCAA CCAGCAGTTA AGCTCAATCG AACGCTCGTA
CAAGATCAAG CTGGCAATAT TCAACTAATC GATAGTCATA ATCAACAATT AACGTTGACA
AATGATGCCT CATCAATCGT GCAATATATT CAGGTTACGC CTGCACCCGA TGGCCAACAC
GTAGCCTATA TTCAACTAAC ACCCACAGTG ATCGAGATTC GAGTGCAGGC CTTCGATGGC
AGCCCAGCTC GCACGGTTTT CAGCGATTTT AATCTGCGCC CATTTTATCT TTCCTGGTCA
CCCAATAGCC AAATGCTGGC CTTTTTAGCA TCAGGCACAA CCATGGAATT ATATGTTGTG
CCCGCTGATG GCTCCGAAGT AGCGCATAAA GTGCGTGATG GACAACCATC GTATTTTGCT
TGGAAGCCTG ATAGCAGCGC TTTGTTATTG CACACTGGCG GCGGTACTCC GGTTGGCAAC
ACCGCAGTCC ACTCAGTTCA ATCCAAAGAT TTAACTTTTT TCAAGGAAAC CGCTGGCGAT
TTTCAAGCGC CTGCTTGGAA TGCTGATGGC TCAGCGCGGG TGGTGGTAGT AGCTGATGGC
GAAATCAATC AACTGATGCA GATTGATCAG GCCGGGCAAC AGGCCTTGAG CGAACCAACC
AGCGAAGGCT TTATGTTTGT GCTTTCGCCC GACCGTGCCA AAGTTGCCTA CCAAACCTTT
GGCCTACAAA CCCGCTCAGG TTTGATGATT CAAACGATTG CTACTGGCAA AAGCCAAAGT
TTCGAAACCG CCCGTCCCTT AGCATTTTTC TGGTCGCCAG ATGGGCGTTC GGTGGCCTTA
TTGGTTGCCG ATGCTCGGCC ACGCGGCCCC AGCGGCGATG CTGGAATTGT CAAAGTCAGT
CGCCAAGCCC AAAGCGGCGT GCAGGTGCAT TGGGAAGTAC TAGATGTCGA ATCGGGCCAA
GTTAAACGGC TCAAATCGTT TGTACCAAGT GGACCTTTTT TGAATGTATT GCCCTATTTC
GACCAATATG CCGCCTCGTT AACCTTCTGG TCGAGCGATA GCCAATATCT GCTCAACAAT
AGCAGCGATG GGGTTTGGCA AGTGCATGTT GAAACAGGCG CAGAACAGCA ACTAACCAAG
GGCGCATTTG GGGTTGCCGT GCCATAA
 
Protein sequence
MRRRIRWLLL IGGVVGLGLA VGIGWWQRDQ PAVKLNRTLV QDQAGNIQLI DSHNQQLTLT 
NDASSIVQYI QVTPAPDGQH VAYIQLTPTV IEIRVQAFDG SPARTVFSDF NLRPFYLSWS
PNSQMLAFLA SGTTMELYVV PADGSEVAHK VRDGQPSYFA WKPDSSALLL HTGGGTPVGN
TAVHSVQSKD LTFFKETAGD FQAPAWNADG SARVVVVADG EINQLMQIDQ AGQQALSEPT
SEGFMFVLSP DRAKVAYQTF GLQTRSGLMI QTIATGKSQS FETARPLAFF WSPDGRSVAL
LVADARPRGP SGDAGIVKVS RQAQSGVQVH WEVLDVESGQ VKRLKSFVPS GPFLNVLPYF
DQYAASLTFW SSDSQYLLNN SSDGVWQVHV ETGAEQQLTK GAFGVAVP