Gene Francci3_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0566 
Symbol 
ID3905789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp656454 
End bp657662 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID637877898 
Productaspartate aminotransferase 
Protein accessionYP_479679 
Protein GI86739279 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCCA TGAGCAGGAT CTCGCACCGC ATCGGGGGAA TCGCCCCCTC CGCAACGCTC 
GCCGTCGACG CCACCGCCAA GGCGATGCGC GCCGCGGGCC GCGACGTGAT CGGGTTCGGC
GCCGGGGAGC CGGACTTCCC AACCCCGGAC CACATCGTCG CCGCCGCCGA GAAGGCATGC
CGTGAGCCCC GCATGCACCG CTACAGCCCG GCCGCGGGCC TGCCGGAGCT GAAGCAGGCC
ATCGCCGAGA AGACCGCCCG AGACTCGGGG GTGATCGTCT CGCCGAGCCA GGTCCTGGTG
ACCAACGGCG GCAAGCAGGC TGTCTACGCG GCGTTCGCGA CGCTGCTCGA CCCGGGGGAC
GAGGTGCTCC TGCCCGCCCC GTACTGGACG ACCTACCCGG AGTCGATCCG GCTGGCCGGG
GGTGTCCCGG TCGACGTCGT CACCGGTCCG GAAGCCGGGT ACCGGGTGAC GGTGGAACAA
CTGGAGGCGG CACGCACCCC GCGGACCAAG GTTCTGCTGT TCAACTCCCC GTCGAACCCG
ACCGGCGCCG TCCACACCCC CGAGGAGGTC CGCGCGATCG GACGGTGGGC CGAGCAGGCC
GGGATCTGGG TCATCAGCGA CGAGATCTAC GAGCATCTGG TCTACGGCGA GACAAGGTTC
GCCTCCGTTC TCGCCGAGGT ACCTGAGCTC GCCGAACGCT GCATCATCGT CAACGGGGTC
GCGAAGACCT ACGCGATGAC GGGATGGCGG GTCGGTTGGC TGGTCGGGCC GGCGGACGCC
GTAGCCGCCG CGACCAACCT CCAGTCCCAC GCGACGTCGA ACGTCTCGAA CGTCGCCCAG
GCCGCCGCGC TTGCCGCGGT CGCCGGCCCG CTGGATGCGG TCGCCACGAT GCGCACCGCC
TTCGACCGCC GCCGGCAGAC GATGCACCGC CTGCTGTCGC AGACGCCCGG GATCCGCTGC
CCACTGCCCG ACGGGGCGTT CTACTGCTAT CCGTCGGTGC GGGACGTGCT CGGGCGCACC
CTGCGCGGCC GTATCCCCAC CACGTCCGCG GATCTGTGCC AGCTGATCCT CGAGGAGGCT
GGGGTCGCGA TCGTGCCGGG TGAGGCCTTC GGCACTCCCG GCTTCGCCCG GCTGTCCTAC
GCGCTGAGCG ACGACGATCT CGTCAAGGGA GTCAGCCGCC TCTCCGCCCT GCTCGCCGAG
GCGGCCTGA
 
Protein sequence
MAAMSRISHR IGGIAPSATL AVDATAKAMR AAGRDVIGFG AGEPDFPTPD HIVAAAEKAC 
REPRMHRYSP AAGLPELKQA IAEKTARDSG VIVSPSQVLV TNGGKQAVYA AFATLLDPGD
EVLLPAPYWT TYPESIRLAG GVPVDVVTGP EAGYRVTVEQ LEAARTPRTK VLLFNSPSNP
TGAVHTPEEV RAIGRWAEQA GIWVISDEIY EHLVYGETRF ASVLAEVPEL AERCIIVNGV
AKTYAMTGWR VGWLVGPADA VAAATNLQSH ATSNVSNVAQ AAALAAVAGP LDAVATMRTA
FDRRRQTMHR LLSQTPGIRC PLPDGAFYCY PSVRDVLGRT LRGRIPTTSA DLCQLILEEA
GVAIVPGEAF GTPGFARLSY ALSDDDLVKG VSRLSALLAE AA