Gene Francci3_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4467 
Symbol 
ID3907443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5339193 
End bp5340395 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content72% 
IMG OID637881799 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_483542 
Protein GI86743142 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC GCACCCCCGA ACAGCTCCGC GACAACCTGG TCGCCGACAT CCACCGCTGG 
GGCACATTCC GAACCGCCCA GGTCGAAGCC GCGTTCCGCA CGGTCCCGCG ACACCTGTTC
CTGCCCGACG TCGACCTGGA AACCGCCTAC GCCCCCCAGG TCGTCGTCAC CCGCCGCGCC
CCCGACGGCA CCGCGCTGTC CTCAGCATCC CAACCCAGCC TCGTCGCCGC CATGCTCGAA
CAGGCCGGCG TCCACCCCGG CCACCGCGTC CTGGAGATCG GCACCGCCAC CGGCATCAAC
GCAGCACTCC TCGCCGAACT CACCGGCCCG ACCGGCCAGG TCACCACCAT CGAGATCGAC
GAGGAGCTCG CCGCAGGCGC GCGCACCGCA CTGGTCAAGG CCGGTTACGA ACGCGTGGAC
GTTGTCCACG CCGATGGTGC GGCGGGCCAC CCGGGCGGAG CGCCCTACGA TCGGATCGTC
ATCACGGCCG GGGCCTGGGA CCTGGCCAAG GGCTGGTGGA ACCAGCTCGC CCCCGCCGGT
CGTATCGTCG TGCCTCTCCG TCTCCACGGA AGCGGCCTGA CCCGCTCCCT CCCGCTCGAC
GCCGTTGAGC CGGGCCGGCT CGTCAGCCGC TCGGCGCTCG TCTGCGGATT CGTCCCCCTA
CGTGGCGCCG ACGCCCACAC CGGCCGTACC CTCGCCCTCG CAGACGGTGT CGCCCTGCAC
GTCGACGACC ACGACCCCGC CGACGAGCCG GCGCTGCGCG CCGCGGCGGC CAGCCCACCC
CACAACCTAT GGACGGGGCT GACGATCCAC GACGACGAAC CGACCGCGCA CCTCGACCTG
TGGCTCGTCA CCATGGGCGC CCGCTTCGGC CGCCTCGCCG TCGACACCAC CGTCCGCCCT
GACAGCCAGC TCACTCCGAC ACGGCGCTGG GCCGGGGCCA CCATCCACGA CGGCACCACC
ATCGCCTACG TCACCCTGCG TCCCCTCGCA TCCGACACCG ACGAACTCGG CGTCACCGCC
CACGGACCCC ACGCGGCCAC CCTCACCGCG CACCTCACCG ATCTGCTGCA CCAGTGGCGC
AAAGAAGGCC CCGCCGAACC TGTCGTAACG GCCCATGCCG CGGACACCCT GGAGGACCAG
ACCGTCGCCG GACACCGCGT CGATCGGCCG AACAGCCGAC TCACCGTCCG CTGGCAGCCC
TGA
 
Protein sequence
MTTRTPEQLR DNLVADIHRW GTFRTAQVEA AFRTVPRHLF LPDVDLETAY APQVVVTRRA 
PDGTALSSAS QPSLVAAMLE QAGVHPGHRV LEIGTATGIN AALLAELTGP TGQVTTIEID
EELAAGARTA LVKAGYERVD VVHADGAAGH PGGAPYDRIV ITAGAWDLAK GWWNQLAPAG
RIVVPLRLHG SGLTRSLPLD AVEPGRLVSR SALVCGFVPL RGADAHTGRT LALADGVALH
VDDHDPADEP ALRAAAASPP HNLWTGLTIH DDEPTAHLDL WLVTMGARFG RLAVDTTVRP
DSQLTPTRRW AGATIHDGTT IAYVTLRPLA SDTDELGVTA HGPHAATLTA HLTDLLHQWR
KEGPAEPVVT AHAADTLEDQ TVAGHRVDRP NSRLTVRWQP