Gene Francci3_2436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2436 
Symbol 
ID3905048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2830347 
End bp2831405 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID637879766 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_481532 
Protein GI86741132 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.326287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.177296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC GGGTGGTGAA CCGGGAGGAC TTCATCCCGG ATGAGGTCTG GGTGACCGGT 
GATGACGGAT TCTTCCTGGT TCCGCTCCGT CGAAGCGAAG ATCCGGAAGG GTGGTTGGCG
CTGGTCCGTA GCGATCAGGC GATCACCACC CAGGTAGACG ATGGTAGGGA CAAGTACGAC
GGTAAGGGGA TCATCCCGAC CAGTTCGTGT AGTGCGCGGT GGGTGGTCGA CCGGATGCTT
GATCTCCTCG GCGTGCGCCC GGGAATGCGC GTGCTGGAGA TCGGGACAGG TACCGGCTAC
AACGCTGCGC TTCTCGCGGT GCAGGCCGGT TCCGGTCAGG TGACCAGCAT GGAAGTCGAC
CCGATGATAG CCGGACAGGC ACGGGCGGCG TTGGACCGGA CCGGCCATCC CGTGCGGGTG
ATCGCAGGGG ATGGGACCGC GGGCTATCCG GCGGGCGCAC CGTACGACCG TGTGATCGCA
ACAGCGTCAG TGTCGGTGGT CCCCTGTTCG TGGGTGGAGC AGACCCGGCC CGGGGGGCGG
ATCGTGTTCC CGTTCGCCGG TACCTTCGAC GGGGCGTTGG CGGTTCTGGT CGTCGATGAC
GATGGTGTGG CCCGCGGCCG GTTCCACGAT GAGGCCGGGT TCATGCGGCT ACGGAACCAG
CGGCGCGACC CGCATGTGTG GTGGCTGGGT GAGGACGACG CGGACGTCAG GCCCACCCGC
CGGTATCTCC GCGAGCCTTT CGATGATGCG GCGACCGGGT TCGCGGTCGG CTTGTGGCTG
CCGGGCTGCA CGACCGGGGA CATCGACGAA GGCGGCCCCG CGAACACCCT GTTGCTGTCT
CACAGCCCGT CGCAGTCCTG GGCATCGCTG ACCGCAGGCC TGGACGAGCA CGAGATCACC
CAGTACGGGC CGCGTCGACT CTGGGACGAG CTGGAGACGG CCTACGACTG GTGGATGAAC
TCCGGCCGGC CCTCCCGCGA TCGGTTCGGG CTCACTGTGA CTCCCGACGG GCAAACCTTC
TGGCTCGACA ACCCGGACCA CGCCATCCTT CTCCGCTGA
 
Protein sequence
MSGRVVNRED FIPDEVWVTG DDGFFLVPLR RSEDPEGWLA LVRSDQAITT QVDDGRDKYD 
GKGIIPTSSC SARWVVDRML DLLGVRPGMR VLEIGTGTGY NAALLAVQAG SGQVTSMEVD
PMIAGQARAA LDRTGHPVRV IAGDGTAGYP AGAPYDRVIA TASVSVVPCS WVEQTRPGGR
IVFPFAGTFD GALAVLVVDD DGVARGRFHD EAGFMRLRNQ RRDPHVWWLG EDDADVRPTR
RYLREPFDDA ATGFAVGLWL PGCTTGDIDE GGPANTLLLS HSPSQSWASL TAGLDEHEIT
QYGPRRLWDE LETAYDWWMN SGRPSRDRFG LTVTPDGQTF WLDNPDHAIL LR