Gene Acid345_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1111 
Symbol 
ID4069226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1386320 
End bp1387774 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content58% 
IMG OID637983120 
Producthypothetical protein 
Protein accessionYP_590188 
Protein GI94968140 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.255858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000181601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAAGA GTTCCCGCTC CCTGGTTGTG GTTTTCATGT TGTTTGTTTT AGTCCTGCTT 
GCCGGATGCG GTACATCGTC GTCTCCCGAT CCGGCCCCCA CTCCACTGAG CGCAAAGAAC
GTAAATCTTA TTTTCGTGGC CAGTGAAGAC CTGCAACATC ACGGCACGCA GGACATCAAC
GACGACACCG CAAATCTCAC CAGTCAAGGG TTACAGCGGA CACTGCTCCT TGGGACCTAT
CTAAAGCAGA ATGTGCTCGG CGGGAAAGCG GTCACCGCGA TCTACGCCCT GGAGCCGATG
ACCCATCTTC AAACTACGAA CAAGTATCCC GACATGGCGC CGTTGATGGC CGTCCAACAG
TTCGCCATGC TCAACCAGGT CAGCACTTCG ATTAACGGCG GAGCGCCGGT CACTGGGAAC
AGCTTTCCGA TCTTCGCTTC ATACGCTGAC AGCGCAGCAT TACCGAACGA TGTCGCACAG
CCCGTCTTCT CGTGCCCAGG TTGCCAAGGT CTGGACTTCA CCGACCAGAA CGGCGCCAAC
GAGGCACTCG TCGAAGCGCT GATCACGGCC AAGAGCCCCG GATACTTTGT CTTTTCCGCG
CCTTGGGACA CGGTCAGCGC GATGATGTCG AACATCAATG CATCGGAAGG TTTCGGACTT
GCCCTGCCAT CGAGTTATGG CGGCCCCGAC CACGTGTATG CGATCTCAAT TGCGCCTTCA
GGAACCGCGG CCCTTGTCGG TTACAACGCC GACCTTCATC CGGGAACAAG TTACCCCGCA
TTGCCGGCAG GCAAAATCGC GAGCGCACGC TGCCAGGGGA CATACAGCGT AAGTGCCGTG
GGCGGCGCCG GCGGCGCGGT GGTTCCTGCG AACACCAACG TGAATGAAAC GGTGTACATG
ATTCGCCACG CCGAGGCGCA TCCTGCAGCC AACTGGGATG ACGGCAACTA CGTGGCTGCG
GGACAATGGC GCGCGCTCGA CTTGCCGAAC GCGCTGGCCG GAAAGATCCA CCCGGACCAG
GTAATCGCCA TCGATCCTGC AATCGGCATA CCTGGCACGC CCGAGAGCAT CACGTCCTCC
TACATTCGCC CTGCGATGAC GGTAGAGCCA TATGCCATCG CGAACAATCT GCCCTACAAC
CTGGCATCGA GCGTTGCAGT GTTTTCGCAA AACGCGCCGC AATTGGCAAC GAAGGCAAGC
AACTATCTCT TCACCAATGG AACGTTTTCG AACCATGTCT TGCTCGTCGC GTGGGAGCAC
AAGCACGTTC CGCCGACGAT CAATGCTTTG CTGTACTCGT ACGGAGTGGC ACAAAACGCG
CCCTTGTGGA ATGACGAAGA CTACGATTCC ATCTGGACTG TTCGGTTGGA TGCGCAGGGA
AATCTGAGCA TCGATAACCT GGCTTGCGAG GGCATCGATT CCACTTCCCT GCCAGCGACA
GCGCCGCAAT TCTAG
 
Protein sequence
MTKSSRSLVV VFMLFVLVLL AGCGTSSSPD PAPTPLSAKN VNLIFVASED LQHHGTQDIN 
DDTANLTSQG LQRTLLLGTY LKQNVLGGKA VTAIYALEPM THLQTTNKYP DMAPLMAVQQ
FAMLNQVSTS INGGAPVTGN SFPIFASYAD SAALPNDVAQ PVFSCPGCQG LDFTDQNGAN
EALVEALITA KSPGYFVFSA PWDTVSAMMS NINASEGFGL ALPSSYGGPD HVYAISIAPS
GTAALVGYNA DLHPGTSYPA LPAGKIASAR CQGTYSVSAV GGAGGAVVPA NTNVNETVYM
IRHAEAHPAA NWDDGNYVAA GQWRALDLPN ALAGKIHPDQ VIAIDPAIGI PGTPESITSS
YIRPAMTVEP YAIANNLPYN LASSVAVFSQ NAPQLATKAS NYLFTNGTFS NHVLLVAWEH
KHVPPTINAL LYSYGVAQNA PLWNDEDYDS IWTVRLDAQG NLSIDNLACE GIDSTSLPAT
APQF