Gene Acid345_2273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2273 
Symbol 
ID4073267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2693675 
End bp2695093 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content60% 
IMG OID637984289 
Productintegrin-like protein 
Protein accessionYP_591348 
Protein GI94969300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.328252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTGC TTCACATCCT TGCTGCGCTC GTCGTTACGG CCGCGCCGCT CGTCGCCCAG 
GTCACTTACA GTTTCAGCAA TTACCCAACC CAAGGCCCGG CCAATCACAT GGTGATTGCC
GATTTCAACC GCGACGGCTA CCCCGACATG GCGATCGTCG TCTCAAATCC GAATGTGGTG
GACGTGTACT TCAACGACCA CACCGGTCAT TTCTCCAATT ACACCGCTTA TCCCACTGGC
GATTTCGGCT GGGCCCTCGC CCTCGACGCC AATGGTGACG GCTGGCCCGA CATCCTGGTC
GCCTCCACCG GAACGGGCAG TACGACACTG CTCCTGAATA ACGGCGACGG TACCTTCCGC
ACGGGAACCG CGCCCATCAC CAAGGCGCAA GCCAGTCAGT TCGTCGCCGG AGACTTCAAC
AAAGACGGCA AAGTGGATCT CGCCGCCATC GAGGGCAACC AGATCGAAAT CCTGCTGAAC
AACGGCAACG GCACCTTCCA TTCCGGGCAG ATGCTCGCAA TGGCGGGGGG CACCTTCAAC
GCCGTGGTTG CCGATTTCGA TGGTGATGGC AATCTCGATA TCTCGAATGC CGAATCGAAC
AAGTTCCTCG TATGGTGGGG CAAGGGAACA GGAGCCTTCG CAGCGCCGTT GCAAGTACCG
GCGCCAACCA GAGGCAGCCT GTTTTCTGTG GCCACCGCCG ACTTCAATAA CGATGGCCTG
CCCGACCTCG CCGTCAGCAG CAATTACAAT CCGGGCAACT GCGATCCCAC GGGCGGACCC
TGCGGCACCA CCACGGCTCA CATCTACAAG AATATGGGCG GCCGCAGCTT CAGCCACATC
AGCTCTTACC AAATCGGTGA TCGCGAAGGC GGCGTGCTTT CCACCGCGGA TGTTAACGGC
GATCTCAACC AGGACATCGT GGATGTCACC ACCGCAGGCG GCGTTGACAG CGGCGTGTAT
TCCTATCGCC CCGGTAACGG AAACACCACC TTCGGCGCCG AGCAAACCAT CACCGGCGGC
TCTGCGTTTG AGATTGTCCT CCGCGATCTC AATCACGACT CGCGCGCCGA CGTCGGTATC
CCTTCCTTCT TCCCCGGCGG CGAAGGGGAC GTAGGCCTTG CCACCAGCGG CTACAAAACC
TGCACCGGCG TGAGTTCGGC ATCGCTCAAC GCGAAGTTCT GTGAGCCTCA AGGCGATGCG
AATGCCACTC CGTCCTTTTA CGTGATGGCA GGCGGCGATT CTCCCCTCGG CGTGCAGCGC
CTCGAGATTT GGGTAGACGG CAAGAAGATC TACCAGAAGC TCGGCAACCA GTTGTACAAG
AAGATCACGC TGAGCGCGGG CCGGCATCGG CTTGTGGTCG TGGCAGTCGA CAAGTACGTG
GGCACCGCCA GCGCAGCGGA GTACGTGAAC GTGCAGTAG
 
Protein sequence
MRLLHILAAL VVTAAPLVAQ VTYSFSNYPT QGPANHMVIA DFNRDGYPDM AIVVSNPNVV 
DVYFNDHTGH FSNYTAYPTG DFGWALALDA NGDGWPDILV ASTGTGSTTL LLNNGDGTFR
TGTAPITKAQ ASQFVAGDFN KDGKVDLAAI EGNQIEILLN NGNGTFHSGQ MLAMAGGTFN
AVVADFDGDG NLDISNAESN KFLVWWGKGT GAFAAPLQVP APTRGSLFSV ATADFNNDGL
PDLAVSSNYN PGNCDPTGGP CGTTTAHIYK NMGGRSFSHI SSYQIGDREG GVLSTADVNG
DLNQDIVDVT TAGGVDSGVY SYRPGNGNTT FGAEQTITGG SAFEIVLRDL NHDSRADVGI
PSFFPGGEGD VGLATSGYKT CTGVSSASLN AKFCEPQGDA NATPSFYVMA GGDSPLGVQR
LEIWVDGKKI YQKLGNQLYK KITLSAGRHR LVVVAVDKYV GTASAAEYVN VQ