Gene Acid345_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4408 
Symbol 
ID4073314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5235534 
End bp5236577 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content60% 
IMG OID637986441 
Producthypothetical protein 
Protein accessionYP_593482 
Protein GI94971434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTC TTTACGGCCT TTTCCTCGTG TTCGTTGCTT TGCCTCTCGT CGCCCAAACC 
TCCACTGCCG ACATGCTTGT CCGCATCGAG CGCGTGCAAA GCGCCGAGGA CTCTCGCTGT
ATCGTCGTTT ACAGCGACTC CTCCTATCAC GCCGAGCGCA CCCACGGCAG TAAGACCGAT
GTCTTTGAAG GCACACTCGA CTCCGCCCGC TACGCGCCCA TCAAAGATCT CGCCGCCGGC
CCACTCAAAC AGGTCCAGCA CGAGCACATC AGCGCACGCT ACGTGGACTC GCCCGACCTC
CTCTTGATCG ACGTAGACCG CGGTGACGAC ATCCAGCAGA TCCGCTTCGC CAACAAGGCT
GATCGCAAGC CCTATGCCAC GCAACTCGAT CCGGTGCTGC AATGGTTTGA CGCCACGATG
AAGCAGCCCG GCGCGCGCTC GGCCGCGAAA GCCGATCGCT GCGTGCCGGA GAAGAGGACC
ATCACCAAGA TCGTCTCCGG CATTCCAGAG AAACAGCTAC CTACCGGCCC CGAGAATCCG
TATCACATCA AGAACCTGCT GTTGCTGTTT ATCACCGAGC GTTACGAGAG CCGCGGCAAC
TCTGCGGGCG AGAAGAATTG CGTCCTGGTG CGGAATGACG GCTCCATCTA CGCCTACAAA
ATGACTGTGG CCTGGCGCGG AGAGCCAGAC AAGTATCGCG AAATGCACGT CAACGTATCA
CCCGATCAGG TGACGATCCT CAAGCGCGCC ATGGACGATC CCACCCTCGC GAATGCGAAA
GACATCCACC CGGAGAAGCG TTCGTACGCG AGCGAGTCGG ACCGCACCGA GGTGGTCGCC
GTTCGCAATA ACGACGTGCA GGTGCTCCAC TTCGCGCAAC TCGTGAACGC GGAGGAAGTC
GGCGCCACGC CGCGCCAAGG CTACCAACCC GTCAAGGGAA GCGACGGTAG CAAAGAAACC
GCACCCGTTC GCGAGTGGAT CAAGCTGAAC CTAGACAACA GCAAACTCCA GAAGGTAGAC
CACCTCAGCG GAGAGTGTCC ATAG
 
Protein sequence
MRFLYGLFLV FVALPLVAQT STADMLVRIE RVQSAEDSRC IVVYSDSSYH AERTHGSKTD 
VFEGTLDSAR YAPIKDLAAG PLKQVQHEHI SARYVDSPDL LLIDVDRGDD IQQIRFANKA
DRKPYATQLD PVLQWFDATM KQPGARSAAK ADRCVPEKRT ITKIVSGIPE KQLPTGPENP
YHIKNLLLLF ITERYESRGN SAGEKNCVLV RNDGSIYAYK MTVAWRGEPD KYREMHVNVS
PDQVTILKRA MDDPTLANAK DIHPEKRSYA SESDRTEVVA VRNNDVQVLH FAQLVNAEEV
GATPRQGYQP VKGSDGSKET APVREWIKLN LDNSKLQKVD HLSGECP