Gene Acid345_1908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1908 
Symbol 
ID4069386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2290109 
End bp2293015 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content57% 
IMG OID637983919 
Productfibronectin, type III 
Protein accessionYP_590983 
Protein GI94968935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0431753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAC ATCATGCAGT TGTTTTCATA CTTCTCTTAA CCCTGGTCAG TTCGTTAGCA 
GCAGTCACAT CGGGTACACC GGCCACCATC ACCAGCCCAA CTCCGGGCAC AAAACTTTCG
AGTACGAGTG CCACGTTCCA GTGGACGACC GGCACGAATG TCACGAGCTA CTCGTTGTAT
GTCGGTACGA CGCGCGGCGC CCACGACATT TACTTCATCA ACACAGGTAC GGGTTTGAGT
GCTTCGGTCA ACAATCTTCC AGCAAATGGC GGGACGTTCT ACGTCGTGTT GAACTCGCTG
ATCGGCGCGG CGTGGCAACA GGTTTCGTAC ACATACATTG CGATGGGCAG TGGCACGGCG
GCTTCAATTA CGGGGCCGAT TTTGGGAGCG AAGCTGACGA CCGGTACGCA GTTGTTCCAG
TGGAGCGGCG GCGCAGGGAT CTCACAGTAC TCGCTGTATA TCGGGAACAC GCGTGGGGCG
CACGATATTG CGTTCATTAA TGCAGGGACG TCAAGCTCGT ACGACGTGAC TGGACTTCCG
ACGGATGGGC GGACGTTCTA TGTGACGCTG TACAGCCTGA ACGGAAGCAC GTGGCTCGCG
AAGAGCTACA GCTACGTCGC CTCGGGAAAT GGTGTGGCGG CTGCGATGAC GAGCCCCGCG
ACGGGTGCGA AGTTGGCGAG CGGCACGCAG TTGTTCCAGT GGAGCGGCGG CGTAGGCATT
TCGCAATACT CGCTGTACAT CGGTACAGTG CGCGGCGGCC ACGACATTGC GTTCGTCAAC
GCGGGGATAG CCAGTTCGTA CAACGTGACA GGCCTACCGA CGAATGGCGA GACGTTTTAC
GTCACGCTAT ACAGCTTGAA CGGGAACACG TGGTTGGCGA AGAGCTATAC GTACTACTCG
AGTGGCGCTG GAACGGCTGC GTATATCACC TCGCCCTCGC CGGGATCGCA GTTCAGCGGG
ACGAGCGCGA CCTTCTCATG GATGGGAGGA GCGGGAATCT CGCAATACAG TTTGTATGTG
GGAACGACAG CTGGCGCGCA CGATATCGCG TTCGTGAATG CAGGATTGAG CACGTCGGCT
AACGTGACTG GACTGCCGAA CGGCGGCCAG ACGATCTACG TGACGCTGAG CTCGCTGAAT
GGGAACACGT GGCTCAGCAA CAAGTACACC TATCAAGCCA GTGGCATAAC GCTGCAGTTC
AACACGAGTA CGAAAGACAT TGCCAGCGGA CTCATCAACT ACGCGTACGC GACGTACTTC
GATGTGAGCG GCGGCAGTCA TCCGTACACG TTTGCGATTG CATCAGGCAG TGCGCCGACA
GGCGTGGTGT TCTCGGGGAC AGCTCCAGGG ATGCTGGCGG GAACTCCCAC TGCCTCCGGA
AACTTCACAT TCACAGTGAA GGTAACGGAT AGCAATAACA ACTCGATTAC CAGCCCGAGC
TTCACGATTC CAATCAGCGC TGGACCGAAC GGCGCGCATA ACTCCTACGT AAATGGACGC
TACATCTGCA CGTATGAGGG ATACGTGGAT AGTGACACCT CGCGGATCGC AACACTGATG
AGCCTCGCGA TTGACGGCGC GGGGCATGTT ACGAGCGGCG TTTATGACTC GAATGGTCGG
AGCACAGGGC TGCTCAATGG AAGCGTGAGC GGGAGCTATA ACCTTGGCGG CGACAACAAC
GGGACGATCA CGCTTTCGAT CGGAAGCAAG ACGCTGAAGT TCGGGATGAT GGGGAACAAT
GTCGGTGGCA GCAGCGTGAG CCAATTCGAC ATCGCACAAA TTGACGATGT TGGATCTGCA
GCTCCCGGCC AGCATGGCGG CGGAGTGTGC AGCAAAGCGA CGACTTCGGC ATTCTCTAAC
ACCACGATGG ATGGAAAGAG CTTCGCGTTC ACACAGCACG GGGAAAACGG AAATGGGATC
CCGCGCGCTC TCGCCGGAAG GTTTACGCTG ACCGCAAGCG GTTCGAACCT GACGATCACC
GGAGGGCAGG CCGACCAAGC GGATGGATCG TCTACTTTGA GGGCGATTCT GTTTGGAGGC
AGCTACACCG AACCCGGTTC CACGGGCCGG TTCACAATCA CTGTCAATCA GACTTCTCCG
ACGACGGACA CCAGCTACGC AGTTGGCTAT GTGATCGATG CAAATCACAT GGCATTTTTG
AATGCTGACA GCGGGAAGGC GCAAGTTGGG GAGATGTACA AACAGCAGCA GGCGTCGTAT
TCGGCGGCGA ACCTGAATTC GAGTTTCGTG ATGCGCGACC TGGAATGGGC ACTCGACGGG
AGCGGCGGGC TGCAATGGAA CAGGGCGCAG ATCATGCAAG GAACAGGTGC CGGCGGTTCA
GGCAGCACCG CCAGCATTAC CATCAACCAG AGCTTCACCA ACGACGCAGA TTCGACGGGG
AGCGAGTACA AGGTCGGAGA CTCGAACGGA ACCGCGACAT TCACGGTTAG CTCGAATGGC
AGAATGGCGA TGAATGACGC GAATCAGGTG ACCGTCGCGT ACTTGTACGA CAACAATTCT
GCATTCGGAG TTAGTGGGGG ACAGAATGTC GGGTCGGGGC TGTATGGAGT CGCGTTCAAT
TACATTGAAC CGCAAATGGC GACAACACCG GGGTCGGGAT CGTATCTCAG CACAGTCGTG
CCCCGGATTG AGCCTGAAGG GAACATCAAC GTCGACCTGG TGACGCTGGG CAGCGGCACG
ATTGCGGGTA TCGGTGATGG CGGAGCCGCG GGGGGCATGG ATTACGCGAG CCCGTTCAGC
GGGACCTTCA CGACGAGCTC GTATGGTGCG TTCTACATAT CTTCGGGTGG CGAGCAAGTC
ACTAGCTGTT TTGTGGTGAG TTCGACGCGC GTGGTTTGCA TTGACGATAC GACCAGGAAC
CCATCGGTAT CAGTGAGCGT GAAGTAG
 
Protein sequence
MRRHHAVVFI LLLTLVSSLA AVTSGTPATI TSPTPGTKLS STSATFQWTT GTNVTSYSLY 
VGTTRGAHDI YFINTGTGLS ASVNNLPANG GTFYVVLNSL IGAAWQQVSY TYIAMGSGTA
ASITGPILGA KLTTGTQLFQ WSGGAGISQY SLYIGNTRGA HDIAFINAGT SSSYDVTGLP
TDGRTFYVTL YSLNGSTWLA KSYSYVASGN GVAAAMTSPA TGAKLASGTQ LFQWSGGVGI
SQYSLYIGTV RGGHDIAFVN AGIASSYNVT GLPTNGETFY VTLYSLNGNT WLAKSYTYYS
SGAGTAAYIT SPSPGSQFSG TSATFSWMGG AGISQYSLYV GTTAGAHDIA FVNAGLSTSA
NVTGLPNGGQ TIYVTLSSLN GNTWLSNKYT YQASGITLQF NTSTKDIASG LINYAYATYF
DVSGGSHPYT FAIASGSAPT GVVFSGTAPG MLAGTPTASG NFTFTVKVTD SNNNSITSPS
FTIPISAGPN GAHNSYVNGR YICTYEGYVD SDTSRIATLM SLAIDGAGHV TSGVYDSNGR
STGLLNGSVS GSYNLGGDNN GTITLSIGSK TLKFGMMGNN VGGSSVSQFD IAQIDDVGSA
APGQHGGGVC SKATTSAFSN TTMDGKSFAF TQHGENGNGI PRALAGRFTL TASGSNLTIT
GGQADQADGS STLRAILFGG SYTEPGSTGR FTITVNQTSP TTDTSYAVGY VIDANHMAFL
NADSGKAQVG EMYKQQQASY SAANLNSSFV MRDLEWALDG SGGLQWNRAQ IMQGTGAGGS
GSTASITINQ SFTNDADSTG SEYKVGDSNG TATFTVSSNG RMAMNDANQV TVAYLYDNNS
AFGVSGGQNV GSGLYGVAFN YIEPQMATTP GSGSYLSTVV PRIEPEGNIN VDLVTLGSGT
IAGIGDGGAA GGMDYASPFS GTFTTSSYGA FYISSGGEQV TSCFVVSSTR VVCIDDTTRN
PSVSVSVK