Gene Acid345_2529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2529 
Symbol 
ID4072173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2986411 
End bp2987727 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content60% 
IMG OID637984546 
Productnodulation efficiency protein NfeD 
Protein accessionYP_591604 
Protein GI94969556 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.942851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.950756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCC TGTTGCTGCT TTTCGCCGCG CTCTGTATCG CCCTGCCGTC TCTCGCGCAA 
GTCGTCCGCC TCAAGCTCGA CGACACCATT CAGCCGGTCT CCGAGGAATA CATCAGCCGC
GGCATCGATT ACGCCGCAGA GCACAACGCA CAGGCTGTCT TGATCGAGTT GCATACCCCC
GGCGGCCTGG TCACCTCTAC GCGCGCCATC ATCTCGAAGA TCCTTGGCTC CAAGGTTCCC
GTAATCATTT ACGTCTATCC GACAGGCGCG AATGCAGCTT CCGCGGGTTT CTTCATCCTT
GAGTCCGCCG ACATCGCCGC CATGGCGCCC GGCACCAACA CCGGCTCTGC CCATCCCGTC
TCGCTGGCCT TTGGCGTAGC GCAAACAAAG GAAGACGACA CGATGAAGGC CAAGATCGAA
AACGATCTCG CGGCCTTCCT TCGTTCATAC GTCTCCAAGC GCGGACGCAA CGTCGCTCTC
GCCGAAACCG GCGTACGCGA ATCAAAAGCC TTCTCCGACC AGGAAGCCCT CAGCCAGAAC
CTGATTGACG TCATCGCCAA AGACGAGCAG GACCTGCTCT CACAAGTGAA TGGCCGCACC
GTAAAACGTT TCGACGGCAG CACGGTCGTC ATGAAAATCG CCGCTGCTCC CATCACCGAT
TACGACCGCT CGCTCAAAGA ACGCCTCCTC GGCTTCCTCG TCGATCCCAA TATCGCGTTC
CTCATTTTCG CCATCGGCGC CATCGGTCTA TACGCGGAGT TCAATCATCC CGGCGCGATC
ATCCCCGGCG TTGTCGGAAC CGTCTGCATC CTGCTTGCGC TATTCGCGTT CCATTTCCTT
CCGATACGTT ACGCAGCCGT CACGCTCATC CTCGCCTCGT TTATCTTCTT TGCGCTCGAA
GCTAAATTCG CCACCCACGG AATTCTCGGC ATCGCCGGTA TCGCCTGCCT CGCTCTCGGT
GGCATGTTGC TGGTGGATGG TCCCATCCCC GAAATGCGCG TCAAGTGGGA AATGGCGCTC
TCTGTCTCGG TCGCCTTTGG CCTGATCACT GTCTTCCTGA TGACCATCGC CCTCCGCGCG
CGCCGCAATA AGGTGACCAC CGGCATTCAA GGCCTGCTGG GCCAGACCGG CGTCGCCCGT
TCGCCGTTAT CACCAACCGG CAAAGTTACG GTCATGGGTG AGATCTGGGA CGCCGCCTCG
CTGGTACCCG TCGCAGCCGG CGAACCGGTC GTTATACGCG GGATTGACGG GCTTACCCTG
CGCGTCGAGC CCGTCGCACA ATCTGTCGCC GCACGAGAAG TCGTATTACA GCGTTGA
 
Protein sequence
MRRLLLLFAA LCIALPSLAQ VVRLKLDDTI QPVSEEYISR GIDYAAEHNA QAVLIELHTP 
GGLVTSTRAI ISKILGSKVP VIIYVYPTGA NAASAGFFIL ESADIAAMAP GTNTGSAHPV
SLAFGVAQTK EDDTMKAKIE NDLAAFLRSY VSKRGRNVAL AETGVRESKA FSDQEALSQN
LIDVIAKDEQ DLLSQVNGRT VKRFDGSTVV MKIAAAPITD YDRSLKERLL GFLVDPNIAF
LIFAIGAIGL YAEFNHPGAI IPGVVGTVCI LLALFAFHFL PIRYAAVTLI LASFIFFALE
AKFATHGILG IAGIACLALG GMLLVDGPIP EMRVKWEMAL SVSVAFGLIT VFLMTIALRA
RRNKVTTGIQ GLLGQTGVAR SPLSPTGKVT VMGEIWDAAS LVPVAAGEPV VIRGIDGLTL
RVEPVAQSVA AREVVLQR