Gene Acid345_1690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1690 
Symbol 
ID4070473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2050058 
End bp2051746 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content58% 
IMG OID637983698 
Productintegrin-like protein 
Protein accessionYP_590765 
Protein GI94968717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0343612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCGGC GCGGGTTCCT GTATGGCATG GGCTTGTTAG GCCTCGCGCA GTCTTTACCA 
ACTTTTGCGC AGACCGCGCC GAATCCCGGG TTCAAACTCG TGGATGTCAC AAAAGCCGCA
GGGATCGACT TTCGACACAA CAACGGCGCG TATGGCGGGA AACTGCTTCC CGAGACACTC
GGTCCGGGTT GCGCCTTCCT CGATTATGAT GGCGACGGTT GGCTCGATGT TCTCCTGGTC
AACGGCACGG ATTGGCCCGC TCATCGTAAA CAGCGGTCGA CTCTCAAGCT CTACCGCAAC
AATCGCAACG GAACTTTCGC TGACGTCACC AAGGCCGCTG GCCTCGATGT CGAGATGTAC
GGCATGGGCG TGGCGGTCGC CGACTACAAC AACGATGGCT TACCCGACAT TTTTGTTACC
TGCGTAGGCC AGAGCAGGCT CTTCAAAAAC ACTGGCAAGG GGACGTTCGT TGACGTCACC
AAGGCCGCCG GTCTCGACAA ACGGACGGCA CTCAGCACCA GTGCTCTCTG GTTCGATTTC
GATCGCGACG GGTTCCTCGA CCTGTTCGTC TGCAACTATG TGAAGTGGTC GCCTGAAACC
GACATCTCTT GCAGCATTGA TGGCAAACAT AAGTCATACT GCACACCCGA AGCGTATCGC
GGCGAAACCT GCTGGCTGCT GCGCAACCGT GGCGATGGAA CCTTTGAGGA CGTCACCGCG
AAGAGCGGGA TATTCGACTC AAGCTCGAAA TCTCTAGGCG TCGCGATGCT CGACAGTAAT
GGCGATGGCT GGCCGGACCT GGTTGTTGCT AACGATACCC AACCCAACAA GCTCTATCGC
AACTTGAAGA ATGGGAAATT TCAGGACGTT GGAGTAGAGA CGGGCCTCGC GTTCAGTATG
GATGGCAAAG CGCGCGCGGG CATGGGCATT GACGTTGGCG ACTTCGAGAA CTCCGGCCGA
CCCGGCATTG CGGTTACGAA CTTCGACAAC GAGATGATCG GACTATACAA GAGCGATGCC
GCCGGCAACT TCAACGATGT CGCAGTGACG GCCGGCGTCG GAGGGCCTTC GCGACGCATG
CTCGGCTTCG GCTGCGCATT CGCAGACTTA AATCTCGATG GAGCGCTGGA TCTGATCGCC
GTCAACGGCC ACATTGATGA CACTGTTCGT GCCGTTCGGC GCGACGTGAA CTTCGAGCAA
GCGCCGCAGA TTTTCCTCAA CGATGGCCAC GGAAAATTCC GCGACATCGC TGGAAGCATC
GGCGCAGACT TCGCCACGCC TAAAGTCGCA CGAGGCCTCG CGTACGGAGA CTTTGACCGC
GACGGCGACC TCGACCTCCT GCTTACGACC AATGCCGGCC CGGCCTACTT GTACCGCAAC
GACCTCAGCA GCGGTAACCG GAGTATTCGC TTCCATCTCA CAGGAACGAA GTCAAATCGC
GATGCAATCG GCGCAACAGT GAAGGTAACG TCCGCTGGCG TTACGCAATC GCGCATTGTG
CATAGCGGTT CGAGCTACCT CTCGCAATCG GAGCTGGCGT TGACGTTCGG GGTGGGCCAG
CGCGACAAAA TCGATCGTGT CGAGATCTCG TGGCCGAGCG GACGCACGGA AGAGTTCAAG
AATCTCGCGA GCGGCAAAGG CTACGCTGCC GTAGAAGCCA AAGGCTGCAA TCCACTCGAT
CATTTCTAA
 
Protein sequence
MNRRGFLYGM GLLGLAQSLP TFAQTAPNPG FKLVDVTKAA GIDFRHNNGA YGGKLLPETL 
GPGCAFLDYD GDGWLDVLLV NGTDWPAHRK QRSTLKLYRN NRNGTFADVT KAAGLDVEMY
GMGVAVADYN NDGLPDIFVT CVGQSRLFKN TGKGTFVDVT KAAGLDKRTA LSTSALWFDF
DRDGFLDLFV CNYVKWSPET DISCSIDGKH KSYCTPEAYR GETCWLLRNR GDGTFEDVTA
KSGIFDSSSK SLGVAMLDSN GDGWPDLVVA NDTQPNKLYR NLKNGKFQDV GVETGLAFSM
DGKARAGMGI DVGDFENSGR PGIAVTNFDN EMIGLYKSDA AGNFNDVAVT AGVGGPSRRM
LGFGCAFADL NLDGALDLIA VNGHIDDTVR AVRRDVNFEQ APQIFLNDGH GKFRDIAGSI
GADFATPKVA RGLAYGDFDR DGDLDLLLTT NAGPAYLYRN DLSSGNRSIR FHLTGTKSNR
DAIGATVKVT SAGVTQSRIV HSGSSYLSQS ELALTFGVGQ RDKIDRVEIS WPSGRTEEFK
NLASGKGYAA VEAKGCNPLD HF