Gene Acid345_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3947 
Symbol 
ID4071330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4668019 
End bp4669422 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content55% 
IMG OID637985973 
Producthypothetical protein 
Protein accessionYP_593021 
Protein GI94970973 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID[TIGR03804] parallel beta-helix repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.917121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.453465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGAAA GCAGAAACGC AATGGGCCGA ACCATGGGGT TGTTCGCAGT TGTCACTTTA 
CAGTTCGTCA TGCTTTGGGG GAGCAGGGCG ATTGCCAGCA CAGTCACACT TAAACCGGGC
GAAAACGTGG CCGCAGCGGT CGCGAACGCT CCTGCAGGGT CAACCTTCGT ATTCACTCCG
GGTACGTATC GCATGCAATC GATCATTCCG AAAGACAACG ATATCTTTGT CGGGCAATCA
TCGACCGGCG TCATCCTGAA TGGCGCCAAG GTTCTAACGA TGGAACCGAA TGGCAAGTAC
TGGACGAAAA TCGAGCCGCT GAATCCAACG GTTTACGTCG CAAACCATTG CAATCCGGGA
CACGCGCGTT GCTACATCCT GAACGATCTG TTCATTGATG GAAAGTTGCA ACGCCCGGTG
AGTTCGCTCA GCAGTCTGGC GGCGGGACAC TGGTATTACA ACCTCACCAC GGGAACGATT
TACATCAGCA CCAATCCGGC TGGACACGTG GTGGAGTGGG CCTATACCAC GTATGCCTTC
CGGGGAGCCG CGACCGGTGT GCAAATCAGC TTTCTCACCG TGAAGAACTA TGCGACGCCC
CCGCAAGCGG GTGCGATCGG AGGGCCGAAC GGCAAGGCGG AGCATTGGTA CATCCACAAC
GTGAATGTCA TGCACAATCA CGGGGCCGGG ATTGCCATCG GAAACTACAG CAAGGTGATG
TACTGCAACT CGTCGAGCAA CGGCCAGGAG GGGCTTGCCG GCCACGGCGC GTATATCACG
ATTGAACACA ACACCTTCGC CTACAATAAC CAGGCCAGCT ACATGAACTT CTGGGAAGCG
GGAGGCGCAA AAGTCACGGA TACCAGCCAT TTGCTGCTTG GCTACAACTA CGTTCACGAC
AACCTCGGCA CGGGATTGTG GGAAGACATG TACAACACTG ACTCGGTCGT CGAAAACAAC
ACCAGCATTA ACAACCTGGT GGGTATTGCC GAAGAGTTCG CGTCGAACCT GACGCTCAGG
AACAATGTCG TGCGCGGCAA CAGGAAGATG GGCATCCTGA TTTCGCTTTC CCGGTATGCG
GAGGTCTATG GCAATACCGC GGAAGTTCCG GTGAACGGGA TTGACGCGAT CCGGGTCGCG
GAAGGTCAAC GCGACGGGAT GAACACCCAC GACGTTCACG TGCACGACAA CATCATGATC
TTCGACGGAA CAAAGTCGGG TCGCACTGGG CTCTCAGGAA ATCTCGATAC CGCGACCAAC
GTGACTTTCA ACAACGACAA GTACTACAAG AAGAACGGTG GGTACTATCA CTGGTTGTGG
GGCGGATCCA CCTGGATTTC CTTCACTGCC ATGCAGAAGG CCGGACAGGA GTTGACGGGA
ACCGTTTCGA CCGGTGCGCC GTAA
 
Protein sequence
MCESRNAMGR TMGLFAVVTL QFVMLWGSRA IASTVTLKPG ENVAAAVANA PAGSTFVFTP 
GTYRMQSIIP KDNDIFVGQS STGVILNGAK VLTMEPNGKY WTKIEPLNPT VYVANHCNPG
HARCYILNDL FIDGKLQRPV SSLSSLAAGH WYYNLTTGTI YISTNPAGHV VEWAYTTYAF
RGAATGVQIS FLTVKNYATP PQAGAIGGPN GKAEHWYIHN VNVMHNHGAG IAIGNYSKVM
YCNSSSNGQE GLAGHGAYIT IEHNTFAYNN QASYMNFWEA GGAKVTDTSH LLLGYNYVHD
NLGTGLWEDM YNTDSVVENN TSINNLVGIA EEFASNLTLR NNVVRGNRKM GILISLSRYA
EVYGNTAEVP VNGIDAIRVA EGQRDGMNTH DVHVHDNIMI FDGTKSGRTG LSGNLDTATN
VTFNNDKYYK KNGGYYHWLW GGSTWISFTA MQKAGQELTG TVSTGAP