Gene Haur_3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3320 
Symbol 
ID5735190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4184330 
End bp4186093 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content52% 
IMG OID641280467 
ProductDNA repair protein RecN 
Protein accessionYP_001546084 
Protein GI159899837 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTAG AATTATTTAT TGCCGATTTC GCAATTATTG ACCAAGTACG CCTGCACTTT 
ACCCCTGCCT TTAATGTGCT GACGGGTGAA ACCGGAGCCG GTAAGTCAAT TATGATTGAT
GCACTCGGCA TGTTACGGGG CGAGCGGAGC GATCCCAGCT TTGTGCGAGC GGGAAGCAAT
CAAGCCCGGG TTGAAGGTAT CTTCACCTTG GCTGATCGCC CTGATATTCT GCCTATTTTA
GCCGAATATG GCCTTGATGG TGCTGATGAT GATCAGATTA TTCTGACCCG TGAAATTCAT
GGAGCCAGCG GGCGTAGCGT TGCTCGAATT AATGGCCGCG CCGTGAGTAG TGCTGTGCTC
CGTGATATTG GCGGGCGCTT GGTCGATATC CACGGCCAAA ACGATAGCCA AACCTTGTTC
AATGTACGCA CCCACGCCGA AATGCTCGAT CGCTATGCTG GAGTTGTCGC TGATCGCGAA
CAACTGAGTC AGCAGGTGAT CGCGATTGAA GCCGTGCGCA GCCAAATTAG CACCTTGCGC
AATGCCGAAG CGCGTCGCCT CGAACGGATC GAAGAATTGA CCTTTTTGGT TGAGGAATTG
ACCAACGCCA AGTTGATCGC TGGCGAAGAG GCTACGTTAA CCAACGAACG CGGTTTATTA
CAAAATAGTG CTAAAATCAC GGGCACGGTT GATACGATCT ATCGTTTGTT GCGCACTGGC
ACGCCAGCCA GCGAACGGCG TTCAGCCACC CGTTCAATTG TCGATAGCCT TGATGATGTG
GCTAATTTGT TGAGCGAATT GTTGCGGCTA GACCCAAGTT TGGCTGGATT GAACGAGCAA
ACCCTTGAAG TGCGCTATCG ACTTGACGAT GTGATCGAAG GCGTGCGGGT CTATCGCGAT
CGGCTGGAGT TTGAGCCAGG CCGTCTCGAA GTGATCGAAG ATCGTTTAGC TGAGTTGCGC
GATTTAGCCA AAAAATACCG TGCTGCCGAT GCTGCCGAAT TGCTCGAACG CTTGACCAGT
GCCAGCGATG AACTGGAAAC CTTGCACTAC AGCGCTGAAC ATATTGCCGA ATTGGTGCAA
CAAGAACAGC AATTGTTGGC AAGCATTGGG CTAGCTGCCG CTGAACTGAG CCGCCGTCGT
CGCCAAGCAG GCGATGAATT GGCTGGGCGG ATTGCCGCTG CCATGAGCGA TTTAGCCATG
CCGCATGTTA AATTTCATGT GCAAGTATCG CAGCGCAGCG ACCCACAAGG CGTATTGATC
GATGATCACT ATCTCGCCTT TGATCGCACG GGAGTTGATC AGATTGAGTT TTTACTCAGC
CCCAACCCTG GCGAGCCACT CAAACCGCTG GCCAAAATTG CCTCTGGTGG TGAATCGGCA
CGCTTGCTCT TGGCGATGAA ATCAATTCTT TCAGCAGTTG ATAGTGTGCC AACCTTGGTT
TTTGATGAAG TTGATGTGGG AGTTGGGGGA CGGGCTGGCC ATGTGGTCGG CGAAAAATTA
TGGGGCATTA GCGATGCCCA TCAAGTGTTG TGTATTACCC ACTTGCCTCA AGTTGCCGCT
TTTGGTGATT GCCATTTTGC GATTGCCAAG CAAGTTATTA ACCAACGCAC CCAAACCTTT
GTGCAACCAC TCAGCGAACA AGAACGCATC GAAGAACTAG CGGCGATGCT TGATGGAACA
CCAGTGAGCG AAGCGAGCCG TCGCTCGGCC AGCGCCATGC TCGAACGGGC TGCCAACTAC
AAACTGGCAA CCAGCAACCC ATAA
 
Protein sequence
MLLELFIADF AIIDQVRLHF TPAFNVLTGE TGAGKSIMID ALGMLRGERS DPSFVRAGSN 
QARVEGIFTL ADRPDILPIL AEYGLDGADD DQIILTREIH GASGRSVARI NGRAVSSAVL
RDIGGRLVDI HGQNDSQTLF NVRTHAEMLD RYAGVVADRE QLSQQVIAIE AVRSQISTLR
NAEARRLERI EELTFLVEEL TNAKLIAGEE ATLTNERGLL QNSAKITGTV DTIYRLLRTG
TPASERRSAT RSIVDSLDDV ANLLSELLRL DPSLAGLNEQ TLEVRYRLDD VIEGVRVYRD
RLEFEPGRLE VIEDRLAELR DLAKKYRAAD AAELLERLTS ASDELETLHY SAEHIAELVQ
QEQQLLASIG LAAAELSRRR RQAGDELAGR IAAAMSDLAM PHVKFHVQVS QRSDPQGVLI
DDHYLAFDRT GVDQIEFLLS PNPGEPLKPL AKIASGGESA RLLLAMKSIL SAVDSVPTLV
FDEVDVGVGG RAGHVVGEKL WGISDAHQVL CITHLPQVAA FGDCHFAIAK QVINQRTQTF
VQPLSEQERI EELAAMLDGT PVSEASRRSA SAMLERAANY KLATSNP