Gene Acid345_2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2275 
Symbol 
ID4073269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2696966 
End bp2698126 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content60% 
IMG OID637984291 
Productaminotransferase, class V 
Protein accessionYP_591350 
Protein GI94969302 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAA TGCAGCGCGT TTACCTCGAC AACAACGCGA CCACGCCGCT CCTCCCCGAA 
GTGCTGGAGG CGATGCAGCC GTATTTCCTC GGACAATTCG GCAATGCGTC GTCCATCCAC
CAGCAAGGCC AGCAAGCGCG GGCCGCGGTG GAGCACGCGC GCGAGCACGT CGCGGATCTT
ATCGGTGCAC GTGAGGCAGA GATCGTTTTC ACCAGCGGCG GCACCGAGGG CGACAACCTC
GCGCTCTTCG GCCTCTGCAA GCCGGGCGAC CACCTGATCA TCAGCACGAT CGAGCACCAT
GCGGTGCTGA ACTCCGCGCA GCGCCTGAAA GAACTCGGCG TAGAAGTCAC AGACGTTCCG
GTAGACGGGC AGGGAATTGT CGATCCTGAC GCGGTAAAGC GTGCGCTCCG CGCAAATACC
AGGCTGATCA GCATCATGCT CGCGAATAAC GAAACCGGTG TTGTACAGAA CGCCGTAGAG
ATTGGGAAAA TCGCCGCCGA AGCCGACGTC TATTTCCACA CCGACGCCGT ACAGGCCATC
GCCAAGATTC CCGTCGACGT AAATGAGATC CGCTGCGACC TGCTGACCCT CGCCGGACAT
AAGATCCACG CACCGCAAGG AACAGGCGCG CTCTACGTGC GCAAGGGTAC GATCCTCGAT
CCACTCTTTT ACGGCGGCCG TCATGAGCGT TCGCGACGTG CGGGGACGGA GAACCTACCG
GGGATTGTCG GACTTGGCAA AGCCGCGGAA CTTGCGATGG CGTGGTTTGA GAATGACGGT
CCAACTCGCA TGGCAGCGCT TCGCGACCGC CTCGAACAAA CGGTCGTCAG CCAACTCGAT
CAACTGACGG TGAACAGTGG CAGCGCCCCC CGAGTGCCGA ACACCACGAA CGTGTCTTTC
GATGGAATTG AAGGCGAAGC TATGGTGATC GCGCTCGATC TGAAGGGCCT TTCTGTCTCC
ACCGGCGCGG CGTGTTCATC GGGCGCAATC GAACCTTCGC ACGTACTGAC CGCCATGGGC
CTTACTCCCG AGCAGGCGCG GGGAAGCATT CGCTTTAGCG TAGGCAAGCA AAATACCGAG
GCTGACATTC AGTTCGCCCT GGAGCGGGTG CCGGAAGTGG TCGCGAAATT GCGCGAGCTG
AGCCCGGTTT ACAGGAAATA G
 
Protein sequence
MNSMQRVYLD NNATTPLLPE VLEAMQPYFL GQFGNASSIH QQGQQARAAV EHAREHVADL 
IGAREAEIVF TSGGTEGDNL ALFGLCKPGD HLIISTIEHH AVLNSAQRLK ELGVEVTDVP
VDGQGIVDPD AVKRALRANT RLISIMLANN ETGVVQNAVE IGKIAAEADV YFHTDAVQAI
AKIPVDVNEI RCDLLTLAGH KIHAPQGTGA LYVRKGTILD PLFYGGRHER SRRAGTENLP
GIVGLGKAAE LAMAWFENDG PTRMAALRDR LEQTVVSQLD QLTVNSGSAP RVPNTTNVSF
DGIEGEAMVI ALDLKGLSVS TGAACSSGAI EPSHVLTAMG LTPEQARGSI RFSVGKQNTE
ADIQFALERV PEVVAKLREL SPVYRK