Gene Acid345_3679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3679 
Symbol 
ID4072282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4349498 
End bp4350913 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content63% 
IMG OID637985702 
Productglutamate synthase (NADH) small subunit 
Protein accessionYP_592754 
Protein GI94970706 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01317] glutamate synthases, NADH/NADPH, small subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.15931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGA CCACCGGGTT CCTCGAATAC ACCCGCGAAC TCCCGCAGCG CCGGCCCGTG 
ATCGAGCGCA TCAACGACTG GTTCGAGATC TACCAGGACT TCCCTGAGCA GAGCCTTCAA
GAGCAGGGCG CACGCTGCAT GGATTGCGGT GTGCCCTTCT GTCACACCGG ATGCCCGGTC
AACAACAACA TTCCCAACTG GAATGACCTC GTGTACCGCG GACGCTGGCG CGAGGCGGTT
CGCCGTCTCC ACGCCACGAA CAATTTTCCG GAGTTCACCG GACGCATCTG CCCCGCGCCC
TGCGAAGCTT CCTGCGTTCT CGGCATCAAT CAGCCGCCGG TCACCATCAA GCAGATCGAG
AAAACGATCA TCGAGCGTGC CTTCGCCCAA GGATGGATCA ACCCCGAGCC GCCGAAGGCC
GAAACCGGCA AGAAGGTCGT GGTCATCGGC TCAGGCCCCG CTGGCCTCGC AGCCGCGCAG
CAACTCCGCC GCGCCGGACA CACCGTCACC GTCTACGAGA AGGCTGACCA CGTTGGTGGT
CTGCTTCGCT ATGGCATCCC GAACTTCAAG CTCGAGAAGC ACGTTGTGGA TCGTCGCGTC
GCCCAGATGG AAGCCGAGGG CGTTCAGTTC GTGACCAGCG CACACGTCGG CGTCAACGTC
TCGGTCGAGC AGCTTCGATC GGAACACGAT GTGGTGTTGC TCTCCGGCGG CGCTGAGCAG
CCGCGCGACC TCTCCGTTCC CGGCCGCGAA CTCAAGGGTA TTCACTTTGC GATGGAGTTC
CTGCCACAAC AAAACCGGCG TAACTCCGGT CTAGCTGTGA ACGACAAAGA GATCCTCGCG
ACGGGAAAGC ATGTCGTCAT CATCGGCGGC GGCGATACCG GAGCCGATTG TCTCGGCACT
TCGCATCGCC AGGGTGCAAA GTCTGTCCGT CAATACGAGA TCATGCCGAT GCCGCCTCAG
GAGCGCGCGG GGCAGACGCC GTGGCCGCTA TGGCCGCTGC AACTGCGTAC CGAGAGCTCG
CACGAAGAAG GCGGCGATCG CCAGTGGTCG GTCGCAACCG CGCAATTCAC CGGCGATGAA
CATGGCAACG TGAAACAACT GCACGGCGTT CAGATCGGTC CACCGCCGAA ATTCGAACCG
ATCGCCGGCA CCGAGTTCGT CGTGGAAGCC GATCTCGTGT TGCTTGCGAT GGGCTTCACC
GGCCCGGTGC GCAACGGCAT GATCGAGCAA CTCGGCGTTG CCCTCGATCC GCGCGGTAAC
GTGCAGACCG ACAACAACTC CATGACCTCG GTCACAGGCG TCTTTGCCGC GGGCGATATG
CGACGCGGCC AATCGCTGGT GGTATGGGCC ATCGCCGAAG GCCGTAAGGC CGCTGCGGGC
ATCGACGCAT ACCTCCACGC GAACGGCGCC TCCTAA
 
Protein sequence
MGKTTGFLEY TRELPQRRPV IERINDWFEI YQDFPEQSLQ EQGARCMDCG VPFCHTGCPV 
NNNIPNWNDL VYRGRWREAV RRLHATNNFP EFTGRICPAP CEASCVLGIN QPPVTIKQIE
KTIIERAFAQ GWINPEPPKA ETGKKVVVIG SGPAGLAAAQ QLRRAGHTVT VYEKADHVGG
LLRYGIPNFK LEKHVVDRRV AQMEAEGVQF VTSAHVGVNV SVEQLRSEHD VVLLSGGAEQ
PRDLSVPGRE LKGIHFAMEF LPQQNRRNSG LAVNDKEILA TGKHVVIIGG GDTGADCLGT
SHRQGAKSVR QYEIMPMPPQ ERAGQTPWPL WPLQLRTESS HEEGGDRQWS VATAQFTGDE
HGNVKQLHGV QIGPPPKFEP IAGTEFVVEA DLVLLAMGFT GPVRNGMIEQ LGVALDPRGN
VQTDNNSMTS VTGVFAAGDM RRGQSLVVWA IAEGRKAAAG IDAYLHANGA S