Gene Haur_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0810 
Symbol 
ID5732710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp917439 
End bp919460 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content52% 
IMG OID641277941 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_001543586 
Protein GI159897339 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTGT CAGAGCAGAC GGTTGCCCGC GCCGCAAGCT TGCGCGATGA ATTGAATCTA 
TACAATCATC ATTATTATAC GCTTGATGCA CCGCTGGTCA GCGATGCTCA ATACGATAGT
TTATTAAATG AATTGCGGGC GATTGAGGCC GAATATCCCG AATTACGCAC CCCCGATTCG
CCGACCCAAC GGGTTGGTAG TGCTCCGTTG AGCAAATTTC CCAAAGTGCA GCACCCTGTG
CCAATGTTGA GCCTTGGCAA TGCCTTTAAT GCCGATGATT TGGCCGCGTG GCGACGACGC
GCTGAACAAA TTATTGGTAC GCAGCCGATG AGCTATACCG TTGAGCCAAA AATTGATGGC
TTGGCCGTGG CATTAACCTA TATTAATGGG GTATTTAGCG TTGGCGCAAC CCGTGGCAAC
GGCGAAATTG GCGAGGATAT TACCGCCAAC CTACGCACAA TTCGCGATGT GCCCTTGCGG
CTGCAACCAA TCGACGGCCA AGCCTTGCCC GAACGCATCG AAGTGCGTGG CGAGGTCTAT
TTGCCTATCG AATCGTTTAA TCAATTGAAT GAACGCCAAG CCCATGCTGG CGAAAAAGTC
TTTGCCAATC CACGCAATGC TGCCGCTGGA TCGTTGCGTC AGCTCGATTC AACGATTACT
GCTAGCCGTC CGTTGCGCTT TTTTGCCTAC GCTGTCGGCC CTTTCAGCGG CGTTGAACTC
AAAAGCCAAG CCCAAACCCT TGATACCTTG CGCACTTATG GGTTTAGCGT TAATCCCGAT
ACGCGGCTTT TTGCTGATTT TGAGGCGGTA ATCGAATATT GCCACGAGTG GATGAGCCGC
CGTGAATCGC TAAGCTACGA AGTTGATGGC GTGGTAGTTA AAATTAATGA TTTTGCCATG
CAACGTGAAT TGGGCGTGGT TGGTCGTGAT CCACGCTGGG CGATTGCCTA TAAATTTCCA
GCTCGCGAAG AAACCACCAC CTTGCTCAAT ATTGTGATCA ATGTTGGTCG CACTGGTAAA
TTGATTCCCA ATGCTGTGCT CGAACCTGTC AGTTTGGGCG GCACGACGGT GCAGCATGCC
TCGTTGCACA ACGCCGATTA CATCATCAGC CGCGATATTC GCATTGGCGA TCGGGTTGTG
GTCAAACGGG CTGGCGATGT GATTCCCTAT GTGATTGGGC CAATCGTTGA GGCTCGCACT
GGCGACGAGC GAGTTTGGCC AGCGCCAACT CATTGTCCAA CTTGTGGTCA GCCAGTCGAG
CAAATTGGCG ATGAAGTTGA TATTTATTGC GTCAATAATA CTTGTCCTGC GCGTTTGATT
CGTTCAATCG AACATTGGGT CAGCCGTGGC GCGATGGATA TTGTGGGCAT GGGCGAGCGC
CAAGCCAGCC AATTTGTCGA AATGGGCTTG ATCAAATCGA TTCCTGATAT TTATCGTTTG
ACGGTTGATA GCTTTGGGGG GCGTGAAGGC TATGGCGAAC GGCGCGTCGC TAATTTGCTG
AATGCGATCG AAGAATCCAA GCGACGCCCG CTTGATCGTG TCATCACCGC TTTGGGGATT
AACGGAGTTG GAACGGTGGC GGCGGCGGAT TTAGCCCGCT ATTTCCGTTC ATTGCCAGCC
TTAGCCCAAG CCACGATTGA GCAATTGACC GCGATTGAGG GGATTGGTGG CAGCACCGCC
CAAAGCGTGG TCGATTTCTT CAATACGCCA GCCAACCAAC AATTAATCGC CGAATTATTG
GCTTTAGGCC TCAAAGCCGA GCCTAGCGAA GTTGCTGAAT TGCAGAGTGA TCGTTTGGCG
GGCAAAAGTT TTGTGATCAC TGGAACCTTG CCTGGCATTA GCCGCGAAGC CGCTCAAGCC
TTGATCGAAG CCCATGGCGG CAAGGTTGGC GGTAGCGTCA GCAAGAAAAC TGATTATTTG
CTGGCAGGCG AGGCAGCTGG CTCGAAATTG ACCAAAGCCC AAAGTTTAGG CGTAAAAGTG
CTGAGCATGG ATGAGTTGCA TGCGCTACTG GTCGATGAAT AG
 
Protein sequence
MAVSEQTVAR AASLRDELNL YNHHYYTLDA PLVSDAQYDS LLNELRAIEA EYPELRTPDS 
PTQRVGSAPL SKFPKVQHPV PMLSLGNAFN ADDLAAWRRR AEQIIGTQPM SYTVEPKIDG
LAVALTYING VFSVGATRGN GEIGEDITAN LRTIRDVPLR LQPIDGQALP ERIEVRGEVY
LPIESFNQLN ERQAHAGEKV FANPRNAAAG SLRQLDSTIT ASRPLRFFAY AVGPFSGVEL
KSQAQTLDTL RTYGFSVNPD TRLFADFEAV IEYCHEWMSR RESLSYEVDG VVVKINDFAM
QRELGVVGRD PRWAIAYKFP AREETTTLLN IVINVGRTGK LIPNAVLEPV SLGGTTVQHA
SLHNADYIIS RDIRIGDRVV VKRAGDVIPY VIGPIVEART GDERVWPAPT HCPTCGQPVE
QIGDEVDIYC VNNTCPARLI RSIEHWVSRG AMDIVGMGER QASQFVEMGL IKSIPDIYRL
TVDSFGGREG YGERRVANLL NAIEESKRRP LDRVITALGI NGVGTVAAAD LARYFRSLPA
LAQATIEQLT AIEGIGGSTA QSVVDFFNTP ANQQLIAELL ALGLKAEPSE VAELQSDRLA
GKSFVITGTL PGISREAAQA LIEAHGGKVG GSVSKKTDYL LAGEAAGSKL TKAQSLGVKV
LSMDELHALL VDE