Gene Acid345_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3594 
Symbol 
ID4072816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4248873 
End bp4250330 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content59% 
IMG OID637985617 
Productintegrin-like protein 
Protein accessionYP_592669 
Protein GI94970621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.343798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGTC TACTGTCCGT CGCTCTGGTT CTACTCATCT GCTTGCCGTT CGCCGGCGCC 
CAGGTAACTT TCAGTAACTC CACCTACAGC GCGCCTTCCA CAACTTCCGG CGTAACCAGC
GGCGATTTCA ATCGCGATGG CGCCCCTGAC ATGGCCGTGC TCAGCGGGCA GAGCGACACC
GACAGCTTCG TCACCGTCTT CCTGGCCACC ACGCCCGGCC ATTTTCCGAG CACCGGCGCC
AACTACCCAA TCCACGCGTT TCCGCAGGAC ATCCGCACCG CCGACATCAA CAACGACGGC
AACCTCGACC TGATCATTTC CTTTAACGCT TCGCCCATCC TCACCATCCT CTACGGACAT
GCGGATGGCA GTTTCACTCC CGGCCCCGAC ATCACTCTCG CGGGAAATGT TCCTGCGCAG
GGCTTTGACG TAGGAGACTT CAACCACGAT GGCAAGATCG ACATCGCCGC AATCGAATGC
GACAGCAGCG ACGTATGCGA CACGCGCGCC CTACTCGGCT CCGGCACGGG CACGTTTACC
ACAAGCTACA AAATTCAAAT GACTGGCGAG GCGCGATCGC TTTCCGCGCG TGACGTGTAC
GGCGACGGCA ATCTCGATCT GATCCTCATC CGCACCAACG AAGTGCTGAT CTTCGGCGGC
GATGGCACCG GCCGCTTCCC CGAGTTCACC CACATCCGTC CGCCGGCGCA TTGCACCGAC
ATCAATGTCT GCAGCGATTC CTTGAACAGT GTCGTGGTCG CCGACTTCAA CAACGACCAG
AAGGTTGACT TCGCCGTGCT CCAGGCGCAC AACTGCGGTG CAGGCTGCGG CTCGAACGAC
GTTTACGTCT ACAAGAACGG CGGCACGTAT CTCTTCACCC AGGTATTCGA CTTGCCTATT
GGCCCGAGTG CCGGCGGGCT CTTGCTTGCC TCCGACCTTA ACGGTGACGG CAACATTGAC
CTGGTAAACG GCAACGGCGA CCACTGGAGG CCCGGGAACA TCTACGCTCA GGGCGCGGGC
AACGGTACAT TCACCGTCCA GAACAACAAC ATCCCGCAAG GCAGCGCGCA GTTGTTCGCG
CGCGATATGA ACCTCGATGC ACGCCACGAT GTGCTCGATA CGATTTGGAT GGACAACGAC
GTCGTTCTTG CGTTGAACAC CTCCGCCTAC ACAAACAACT GTCCTCCGCC TTCATCCGCC
ACAATCGCAG CTAAGATTTG CGCACCGACC AATGGCTCTA CTGTGGCCTC TCCCGTGCTC
ATCAAAGCCA GCGGTAACTC GCCCGCCGGT GTGGTGCGAC TGGAAGTCTG GGTGGACGGC
GTGAAGAAAT ATCAGAAGTG GAACGACCTT ATCGAAAAGA AAATCGCGCT CAGTTCCGGC
AGCCATCGCG TCACGGTGGT GGCGGTCGAT ATGTATGTCG GCACCGCGAA AACCAGCGTG
ACGATCAACG TGCAATGA
 
Protein sequence
MSRLLSVALV LLICLPFAGA QVTFSNSTYS APSTTSGVTS GDFNRDGAPD MAVLSGQSDT 
DSFVTVFLAT TPGHFPSTGA NYPIHAFPQD IRTADINNDG NLDLIISFNA SPILTILYGH
ADGSFTPGPD ITLAGNVPAQ GFDVGDFNHD GKIDIAAIEC DSSDVCDTRA LLGSGTGTFT
TSYKIQMTGE ARSLSARDVY GDGNLDLILI RTNEVLIFGG DGTGRFPEFT HIRPPAHCTD
INVCSDSLNS VVVADFNNDQ KVDFAVLQAH NCGAGCGSND VYVYKNGGTY LFTQVFDLPI
GPSAGGLLLA SDLNGDGNID LVNGNGDHWR PGNIYAQGAG NGTFTVQNNN IPQGSAQLFA
RDMNLDARHD VLDTIWMDND VVLALNTSAY TNNCPPPSSA TIAAKICAPT NGSTVASPVL
IKASGNSPAG VVRLEVWVDG VKKYQKWNDL IEKKIALSSG SHRVTVVAVD MYVGTAKTSV
TINVQ