Gene Acid345_2439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2439 
Symbol 
ID4072873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2883702 
End bp2885048 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content58% 
IMG OID637984455 
ProductTPR repeat-containing protein 
Protein accessionYP_591514 
Protein GI94969466 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.980065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGA CCTCAATTAA ATTCGTTTCT CTCCCGTTCC TGTTGTTCGT GCTTATCACC 
GTTTTCACAT TGGCGGAGCC CAAGTCGCCC AACGCGAACA GCGCGGCCGA ACTCAACAAG
CATGGTGAAG AGTTATTGGC GAAGCGCGAT TATCGCGGCG CGGTAAAACA GTTCAAGAAG
GCGCTCACCG TCCAGTCCGA TTACGAACCT GCCGTTCGCA ATCTCGGAAC GGTAATGGAA
GTCCTCGGCA AGGATGCGGA ATCGGAGACC GATCTCCAAA AAGCAATTCG GCTCGCTCCG
GAAGACGCTG TCGCCCACAA TAGTCTCGGC CGCACCCTCT TTCACGAGGG CAAGTACGAG
GATTCAGCGG GCTCCTATCG CAAGGCGATC GAGATTCACG ACGATTACGC CGAGGCCTAC
AACGGACTGG GCGCAGCCTT GCTGAAGCTT GGCAAGACCG ATGAAGCCAT CGGCGCGTTC
CAGTCTGCGG CCTCGAAAGA TCCGAAGAAT GTGGACGCTT TGAGCAATGC CGGCGCCGCG
CTGCTGCACG CGCAGAAAGC GCAGGATGCG CTTCCGTATC TGGAGAAAGC TAAGGCGCTG
AAGCCCGATG CGCCCGACGT CCTGGAGAAT TACGCCAACG CTTTGCAACA ACTCGGCCGC
ACCAACGAAG CGATTACGGA ATATGAGAAG GCGCTCAAGG GCGATCCCAA GAGTGCCGTC
GCGTGGGCGC AGTTAGGACA AACGCAGTAC GCCGCAAAGC AATATCCGGA AGCCGAAGTC
AGCTTTAACA AGAGCCTCCA CCTCGATGCG CATCAACCGG AGGTGCTCTT CCTCCTCGGT
GCGGCCTACA CCGAGCAGGG CAAGTCGAAA GAGGCGATGC ACTCTTACGA GAAGGGCCTT
GCGCTGAAGC CCGACAACCC CGATGGCCTC TACAACCTCG GCCATGCGTA TGAGACGCAG
AAGGAATATC CCAGGGCGAT TGATTCTTAC CAGAAGGCAC TCGCTGCTCG TCCGGAGTTC
ACCCATGCAC TCGCCGGACT CGGCGCTTGC CAACTCGCTT CCAACAAACT CGATGATGCG
ATTGCTACCT ATCGCAAGCT GGTTCCGATG CAATCCGACG ATCCTGGCAT TCGCTTCAAC
TTCGCGACTG CGCTCTTCAA CAAAGGCAAT TTCAAGGAAG CGGCGGAGAA CTATCGCGAG
GCCGTGAAGC TGAAACCGGA CTTCGCACAT GCCCACTACA ACCTCGGCAT GTCGCTTCTG
CGCTTGAATG ACGCGGCGGG CGCCAAGTCG GAGTTTGAAG AAGCGCATCG CCTCGACGCC
AGCCTGCAGA TCCCGAAGAC GAGCTGA
 
Protein sequence
MTQTSIKFVS LPFLLFVLIT VFTLAEPKSP NANSAAELNK HGEELLAKRD YRGAVKQFKK 
ALTVQSDYEP AVRNLGTVME VLGKDAESET DLQKAIRLAP EDAVAHNSLG RTLFHEGKYE
DSAGSYRKAI EIHDDYAEAY NGLGAALLKL GKTDEAIGAF QSAASKDPKN VDALSNAGAA
LLHAQKAQDA LPYLEKAKAL KPDAPDVLEN YANALQQLGR TNEAITEYEK ALKGDPKSAV
AWAQLGQTQY AAKQYPEAEV SFNKSLHLDA HQPEVLFLLG AAYTEQGKSK EAMHSYEKGL
ALKPDNPDGL YNLGHAYETQ KEYPRAIDSY QKALAARPEF THALAGLGAC QLASNKLDDA
IATYRKLVPM QSDDPGIRFN FATALFNKGN FKEAAENYRE AVKLKPDFAH AHYNLGMSLL
RLNDAAGAKS EFEEAHRLDA SLQIPKTS