Gene Francci3_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3778 
Symbol 
ID3906062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4527275 
End bp4528648 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content70% 
IMG OID637881104 
Productglutamate-1-semialdehyde 2,1-aminomutase 
Protein accessionYP_482858 
Protein GI86742458 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0184634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGG TCGTTGCGCG GGCCAGGGTC GCGGAGCTGG CCGCGCGCGA GTCCGCCCGC 
CTCGACGTCC GCACCCGGGG CTCGGAGGCG CTGCATGCGC GGGCGGTGCG GTCGATGACC
TCCGGGGTGC CGTCGTCCTA CCAGGTGCGT GATCCCTGGC CGATCTACCT CACCCGCGGT
CTCGGGTCGA AGGTCTGGGA CGTCGACGGC AACGAGTACT CCGACTTCCA CAACGGGTTC
GGTTCGATGG TGCAGGGGCA CGCCCACCCG GCGATCGTGC GGGCCGTGAC CGAGCGGGTG
GCGCTCGGTA CGCACTTCGC GATGCCCACC GAGGACTGCG TGGTGGTCAG CGAGGAGTTG
GCCCGCCGCT TCGGCCTGCC GCAGTGGCGC TATGTCAACT CCGGCTCCGA GGCGACGATG
GACGCGATCC GCATCGCCCG CGGCGTCACC GGCCGCGACA CGATCGTCAA GATCTTCGGT
TCGTACCACG GGCACCACGA CTACGTGATG GTGTCGATCG GGACCCCGTA CGACGACATC
GGTCCGGCCG AGAACATGAA CTCGTTGGGT TACGGTGCCG GGATCCCCCG GGTGGTCGTC
GACCTCACGG TGCCGGTCCC CTTCAACGAC GCTCCGGCGA TGGAGCGGCG GATCGCCGCG
CTCGCCGCCG AGGGACGCCT GCCCGCCTGT GTGATCATGG AGCCGGCGAT GATGAACCTC
GGCGTCGTCC TGCCGGAGCC CGGTTACCTG GCGGCGGTCC GGGAGATCAC CTCCCGGTAC
GGGGTTATCC TGATCTTCGA CGAGGTCAAG ACGGGGCTGT GCGTGGCGGC CGGTGGGGCC
ACCGAGAGGT TCGGCGTGCG CCCGGACCTG GTGACCCTGG CCAAGGCGCT CGGTGGCGGG
CTGCCGTCCG GGGCGATCGG CGCGACGGCG GAACTGATGG AGGCCGTGGC CTCGGACCGG
GTGAAACAGG TCGGCACGTA CAACGGCAAC CCGCTGACCA TGGCCGCCGC CCGGGCGAGC
CTGTTCGAGG TGCTCACTCC CGACGCCTAC ACCCACCTCG ATCGGCTGGG TGGCCGGTTG
ACCGCCGGCT GCGACGAGAT CCTGACCCGG CACGGCATTC CCGGCTACAC CGTCGGCATC
AGCTCGAAGG GATGCGTGCA CTTCACCGAC GCCCCGATCC GTGACTACAC CTCGTTCATG
GCGCACCAGA ACGCCGAGTT GCCCGAACTG GCCTGGCTCT ACAACGCCAA CCGCAACGTC
CTCATGGCGC CCGGGCGCGA GGAGGAGTGG ACGTTGTCGG TGCAGCACAC CGACGCCGAT
GTCGACCGCT ACCTCGACAG CCTCGACCAG ATGGCCCGGG ACCTCGTCGG CTGA
 
Protein sequence
MTRVVARARV AELAARESAR LDVRTRGSEA LHARAVRSMT SGVPSSYQVR DPWPIYLTRG 
LGSKVWDVDG NEYSDFHNGF GSMVQGHAHP AIVRAVTERV ALGTHFAMPT EDCVVVSEEL
ARRFGLPQWR YVNSGSEATM DAIRIARGVT GRDTIVKIFG SYHGHHDYVM VSIGTPYDDI
GPAENMNSLG YGAGIPRVVV DLTVPVPFND APAMERRIAA LAAEGRLPAC VIMEPAMMNL
GVVLPEPGYL AAVREITSRY GVILIFDEVK TGLCVAAGGA TERFGVRPDL VTLAKALGGG
LPSGAIGATA ELMEAVASDR VKQVGTYNGN PLTMAAARAS LFEVLTPDAY THLDRLGGRL
TAGCDEILTR HGIPGYTVGI SSKGCVHFTD APIRDYTSFM AHQNAELPEL AWLYNANRNV
LMAPGREEEW TLSVQHTDAD VDRYLDSLDQ MARDLVG