Gene Acid345_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4122 
Symbol 
ID4072313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4883375 
End bp4884901 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID637986153 
Producthypothetical protein 
Protein accessionYP_593196 
Protein GI94971148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTCG CAGCTCTGGC CCTATCGCTT GTCCTGGCTC TTCCACTCGT AGCCCAAACT 
GGTTTCTCAA CCCACACGTA CGCCGCGCCG GTGGTCCAGC ACCTCATCCC GGTGGACGTC
AACGGCGACG GCTTTCCCGA TCTCGTCACC TATGGCGGCC CGGTGGACGT CTATCTCAAC
GATGGCCATG GTGGCATGCT TCCGCGCAAG TACATCGGCG TATTTGCGGC GTACGCCGCA
GTCGCGCAAT TCGCTGGCGA CGCACTCCCC GAAATCGCGA CCTGCCACAT CAACAGCGGC
AATACAGCTT CCACCATCAG CATCTATATC AACCACGGCA GCGGGAATTT CTCGCTCGCC
GGAAGCGCGC CGCTGGATGG CATTTGTACC AGCCTGAGCG TCGGCGATGT CGATGGCGAT
GGCAAACTCG ATGTCGTTAC GACTTCGTAC GCGCACGACG CGAGTTCCAA CATTACGGGC
ACGGCGATCA CCACCTTCTT CAGCAACGGC TTGGGAAGCC TCAATCGCTC CGTCACGCAA
GCGAATCCCG ACGTCAGCGC GCAAAACGAT CCGACGAACA TTTTTGATTG CCACCTCAGT
TCCGCGACCG GCGGCGACTA TGAGCAGGCC GGGCGGCTTG ATCTCGTCCT GATCGGGGCG
TGCCAGTCGG GGGCCACGAA TGCCGGAACG ATCTTCTACG GCACCAGCGA CAAGGCCGGC
CATTACTCGC TGAAGGAGAT CAAAGAGGAC GAGCGAGGCT TCGACTACTT CCCGCCGTAC
ACCGCCGACG TGAACGGCGA TGGAACCCGG GACGTCGTGC TCATCGACTA CCAATCCGGC
CCGCACAACT CGTGGAGCAA CAGCCTCGAT TTCCTCGAGA ACCACCTCAA CGGCCAGTGG
ACGCTCAAGC AAGTCTTCAC CGAGAGTTCG TACGCGGCGA GCTACCTGAG CGCGGTCTTT
TCCGGCGCCG GCGCAGACTT CATCAACGGC GGTAGCTGGG AAGCAGTCGC GGGCTTCACC
CAATCACCAG ATTGCTGCAC ACAGGACACG CCGGGGATCG CAATCCTCAC CCAGGACACC
TCCGGCAACT ACGTTGAGTC GCAGCGCTGG ACAGCCAAGG CGTACCCGTA CGCGACGGTA
GCTGCGGACT TCAACAAAGA TGGGAAGCCG GACATCGCAA CCGTGCAGGT GAACAGCAGC
GCCAACACGG CTTCGCTGGT CCTGTATCTC AACTCCGGCG CCCCCGCGTG CCCCGCGCCC
AGCAGCCCCG GCGTGCATCT CTGTTCGCCC GTAGCAGGCA ACACCTACAC GTCGCCGGTG
AACGTGCGGG CGACCGGCAA AGCCGCCAGC GGCAGCGTTG TGCGACTCGA GTTGTGGGTT
GACGGCAAGA AATACGGCAA CTTCAGCGGT TCCACGATGA ACGCGAATGC GTTGCTGGGC
TCCGGCTCAC ACCGCATCGT AGTAGTCGAA GACGACTCCA CCGGCGGCCA CCTGAACTCC
ACTCCCGCCT ACATCACCGT GAATTAA
 
Protein sequence
MRFAALALSL VLALPLVAQT GFSTHTYAAP VVQHLIPVDV NGDGFPDLVT YGGPVDVYLN 
DGHGGMLPRK YIGVFAAYAA VAQFAGDALP EIATCHINSG NTASTISIYI NHGSGNFSLA
GSAPLDGICT SLSVGDVDGD GKLDVVTTSY AHDASSNITG TAITTFFSNG LGSLNRSVTQ
ANPDVSAQND PTNIFDCHLS SATGGDYEQA GRLDLVLIGA CQSGATNAGT IFYGTSDKAG
HYSLKEIKED ERGFDYFPPY TADVNGDGTR DVVLIDYQSG PHNSWSNSLD FLENHLNGQW
TLKQVFTESS YAASYLSAVF SGAGADFING GSWEAVAGFT QSPDCCTQDT PGIAILTQDT
SGNYVESQRW TAKAYPYATV AADFNKDGKP DIATVQVNSS ANTASLVLYL NSGAPACPAP
SSPGVHLCSP VAGNTYTSPV NVRATGKAAS GSVVRLELWV DGKKYGNFSG STMNANALLG
SGSHRIVVVE DDSTGGHLNS TPAYITVN