Gene Haur_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3848 
Symbol 
ID5735713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4830315 
End bp4832750 
Gene Length2436 bp 
Protein Length811 aa 
Translation table11 
GC content47% 
IMG OID641281001 
ProductNB-ARC domain-containing protein 
Protein accessionYP_001546612 
Protein GI159900365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATGT GGAATGAACG CACCGTGCAA GATGTGTTAC AACGCCCAGA ACGCTTGCTT 
AGCCAAACTG ATTGGCGGCA CATAATTCAG CAACAGGGTG GATTAACTGC GTTTTACCAA
CAATTGCAAC AACTGCCACT TGAAGCCAAT CAACAAGCGG TGCTGAACGT TTTGACGACC
TATCCAGGTG CGCCAGTTGA AACCTATTGT TCATTGCTGA ATGTGCACAA AGCGACCTAT
CATCGCTATC AAAAAGCCTT GATTCAACAA CTAACCAGCT TGCTCAATAA CGAGCAACCG
CAAGCACACA CGCCAACCCA AGCGCCGTCG TTGCACCAAC TGCGCCCGAT TTTGGCCGAT
TTTGTGGGCC GTACAGCCGA ACTCAAACAA GCTCACTATG CGATCGATAT CGCCCACAAT
GCCGCACAGG GTGCGGTAAT TAACGGAATT CAAGGCATGG GCGGGGTTGG AAAAACTGAG
CTAGCAATCT ATTTGGCGCA TCAGTTGATT CCGCATTTTC CTGATGCCCA AATTGTGCTC
AATTTGTATG GCTCGCGCGA GCAACCGCTG ACGATTGAGC AAGCACTGGG CACGGTGATT
GCCCTGTTTA AGCCGAATGC CAAATTGCCT GAACAACGCG AAAAACTGCT TGAAATCTAT
CATGAGGTGT TGGCTGATAA GCGGGTATTG ATTTTGGCCG ACGATGCGCG AGATTTGGCC
CATGTCCAAG ATTTAACCCC GCCAGTTGGT AGTTGTTTGT TGGTCACTAG CCGCTTGCGG
TTTGCGATGC CGCTGATGGC GCAACTGCAT CTGACCGAGT TTCAGGAGCC TGAGGCCATC
GCTTTGCTCC AGCAAATTTG CCCACGACTT GAGGCTGAAA CTGCCCAGCA ATTGGCCGTT
GCCTGTGGTT ATCTACCCTT AGCTTTGCGC ATCAGTGCCA GTATTTTGGC CCAAAACCCT
GAGTTAGCGG TTGCCGAATA TCTGATCCAA CTGCGCGATC AACAGCAACA ACTCGCCGCC
TTGGAATACC CCGATGATCC GCAGGCCAGC GTGGCAGCAT CGTTGGCCTT GAGTTACGCC
CGTTTGCCCA GCGAATTGCA AGCCTTGGCG CGTCAACTCA GCCTGATTGT GGCCGACTTT
AGTAGTGCCA TGGGTTTAGC AACAGCAGGG CTGGATTTCA ACATGGCCAA CGAAAATTTG
CTGTATAAAT TGGCCTTGCA CAACTTGATT CAATTTGAGC ATCGCCAAGA GCGCTGGCGC
ATGCACGACC TCGTTCGCAG CGTGCTACGC CGTTATTTGG ACGAAGCAGA ACAAACTCAA
ACTCTATTGA ATTATGCCCA AGCCAGTGTT GAAACTCTCA AAATTATTTA TCAGGATTTT
CGAGCAGGCG GAGCTACTCA AACTAAAAGC ATTGATAATT TTGATCGCGA ATATGCTCAT
ATTGTAGCAA TTTGGCAATG GGCGCAACAA CAACCGATCT CGCCAGTAAT AGATAAAATC
GTGGTTGAAT TAGGATTTTC CAGTGGTGGA GTTAGTCGGA TACGAGTTGG TCGTCGCTAT
AGCACTTTAG CCGAACATGA GTTTGGGTTT GAGGCAGCTC TGCGTATTCA AGAGCTTTAT
AAAGCAGCAA TATTTGCTGG AGCACTTGCT AATAGATATC TCGCTCGTGC AGAGTATAAA
ATATCACTCG GGTGGCATGA ACGAGCTTAT GCGCTTGCTC TTGAGATCAA CGATCTTTAT
TTACAATCGT TATTTTTAGG TGATATGGCC ACATGTTATA GCCAAATGGG TGGTGATAAA
CATCTGGATA AAGCCCTGGA CTTAGAGCGC GAGGCACTAC GATTGTTTAG GTTAAGCGGG
TATGAGGGTT CAGGTGAAAG CCTCAGAGTG AATAATCTCG CAACAAACCT TGCATTGCTT
GGATACCATG AAGAAGCTGC TGAATATTTT ATTGAAGCTG TTACTATCGC TCAAAAATCT
GAGAACCAAG CTGATGAGTG TCGTGCACTC TATAATCTCG GTGAAACGTA TCTCAAATTG
AACCAACTTG ACCAAGCCCA AATAGCTTTT GATCAAGCAC TTACTATTGT TGAGCGCATG
AATTTTGATG AAGGCCGCGC TTACATGTTG CAAGGCCAAG CAAATGTAGC GATGTTGCAA
AAAAACTATC ACCAAGCAAT CCAAAGATTC AATCAAGCAT ATGCACTCAT GCAGCATTAT
AATCGGACTA TTGCGTTAAA CATTCAATGG AAAATCGGCC TCCTGTATTG GAAGCTTAGT
GATGTGTTAG CAGCTGAAGC CCAAATGCAA GCAGTGCTTG AACAAGAACG CCCATTGGGG
ATTGATCGGG TGCAAGATCA TGAACTACAA CTGAGCAATT TGCGCAATCG CCAACCCTTT
GATGATAGCC TGTTGGTATC GATGCTCAAA GAATAG
 
Protein sequence
MTMWNERTVQ DVLQRPERLL SQTDWRHIIQ QQGGLTAFYQ QLQQLPLEAN QQAVLNVLTT 
YPGAPVETYC SLLNVHKATY HRYQKALIQQ LTSLLNNEQP QAHTPTQAPS LHQLRPILAD
FVGRTAELKQ AHYAIDIAHN AAQGAVINGI QGMGGVGKTE LAIYLAHQLI PHFPDAQIVL
NLYGSREQPL TIEQALGTVI ALFKPNAKLP EQREKLLEIY HEVLADKRVL ILADDARDLA
HVQDLTPPVG SCLLVTSRLR FAMPLMAQLH LTEFQEPEAI ALLQQICPRL EAETAQQLAV
ACGYLPLALR ISASILAQNP ELAVAEYLIQ LRDQQQQLAA LEYPDDPQAS VAASLALSYA
RLPSELQALA RQLSLIVADF SSAMGLATAG LDFNMANENL LYKLALHNLI QFEHRQERWR
MHDLVRSVLR RYLDEAEQTQ TLLNYAQASV ETLKIIYQDF RAGGATQTKS IDNFDREYAH
IVAIWQWAQQ QPISPVIDKI VVELGFSSGG VSRIRVGRRY STLAEHEFGF EAALRIQELY
KAAIFAGALA NRYLARAEYK ISLGWHERAY ALALEINDLY LQSLFLGDMA TCYSQMGGDK
HLDKALDLER EALRLFRLSG YEGSGESLRV NNLATNLALL GYHEEAAEYF IEAVTIAQKS
ENQADECRAL YNLGETYLKL NQLDQAQIAF DQALTIVERM NFDEGRAYML QGQANVAMLQ
KNYHQAIQRF NQAYALMQHY NRTIALNIQW KIGLLYWKLS DVLAAEAQMQ AVLEQERPLG
IDRVQDHELQ LSNLRNRQPF DDSLLVSMLK E