Gene Francci3_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3133 
Symbol 
ID3903930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3704067 
End bp3705662 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content73% 
IMG OID637880454 
Productleucyl aminopeptidase 
Protein accessionYP_482219 
Protein GI86741819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.100377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA TCGTGCCCAG TACCGCCGCT CTCGCCGATC TCGATGTGGA CGCGGTGGTG 
ATCGCCACCG CCACCGGCGA CGAGGGGCTC CTCGTAGCCG GTGGCGCGGC GGACCTGGAC
GCCGCGCTCG GCGGCAGGCT GACGCAGGTG CTTGCCTCGC TGGGCGCGAC CGGCAAGGCC
GGCGAGACGG TCCGCTTCGC CACCCTCGGT ACCGTGCCCT GCGCGACGGT TCTCGCGGTC
GGGCTGGGCC CCCTCGCCAC GTCGTCCACG CCGATGTCGT CCACGCCGAT CGGCACCGAG
GCACTACGCC GGGCGGCCGG GGTCGCGGTG CGCTCCCTCG CCGGGACCGC GCGGGTGGCC
GTCGCTCTGG CGGCCGCGCC GGGCGCGGTC ACGTCGGAGT CGGTGCGTGC CGTGGCCGAA
GGCGCGTTGC TCGGCACCTA CTCCTACGAC GGGCTGCGCA CCACGTCGGC GAACGGCCGC
CCCCGCCCCG TCGAGGAGCT GACGGTGCTC GTCGACGAGC AGAGCCTCCC GACAGCGGAA
GAGGAACTGC GCCGGGCCAC CGTCGTCACG GATGCGGTCA CCCTGGTCCG CGATCTGGTG
AACACGCCGC CGAGCCATCT GTCCCCCGCC CTGCTCGCCG ACATCGCCGT CGAGCGGGCC
ACCGCCGCCG GCGTGACGGT CGAGGTCCTC GACGAGGTGA CGCTCGCCGA GGGCGGCTTC
GGAGGCCTGC TCGGTGTCGG CCAGGGATCC GCGAATCCAC CGCGGCTGGT GCGGCTGGAA
TGGCGCGGCG GTGGATCCGA GCGGCCGGCG CTGGCGCTGG TCGGCAAGGG CATCACCTTC
GATTCCGGGG GGTTGTCCCT GAAGCCGTCG ACCTCCATGG AATGGATGAA GACCGACATG
GCGGGCGCGG CCGCGGTGCT CGGCACCGTC ATCGCCGCGG CCCGGCTCAA GCTCCCGATC
ACTCTCACCG GCTGGATGCC CTGCGCGGAG AACATGCCCT CGGGCGACGC GATCCGCCCC
TCCGACGTTC TCACCCTGCG CGGCGGCACC CGGGTGGAGG TGCTCAACAC CGACGCCGAG
GGCCGGCTGG TGCTCGCCGA CGCCCTCGTC CGCGCCAACG AGGAGTCACC GGACCTGATC
GTGGACGTGG CCACCCTGAC CGGCGCACAG ATCGTCGCCC TCGGCCACCG GACGAGCGGA
CTGATGGGTC GTGCGGCCGC GGTGGACGCG GTCGCGTCGG CGGCGGCCGC CGCCGACGAG
AGCGTCTGGC CGATGCCCCT TCCGCCCGAT CTGCGCAAGG GACTGGACTC GACGGTGGCC
GACATCGCCA ACGTGCCACC GGGAGGCAAC CGAGACGGCG GCATGCTGGT CGCCGCGCAC
TTCCTCGCCT CGTTCGTCCC CGAGAAGGTG TCCTGGGCGC ACATCGACAT CGCCGGTCCC
TCCTGGAACG GTGGAGAGCC CTACGGCCAC ACCCCCAAGG GAGGCACCGG CATGATCGTA
CGCACGCTCG TCCAGCTGGC GCAGGACCGC GCGACGACGA CGGCCTCCCC CGGTTCGGCC
ACCGGTTCCC CGGCTGAGTC CCCGGCCGAG GGCTGA
 
Protein sequence
MTTIVPSTAA LADLDVDAVV IATATGDEGL LVAGGAADLD AALGGRLTQV LASLGATGKA 
GETVRFATLG TVPCATVLAV GLGPLATSST PMSSTPIGTE ALRRAAGVAV RSLAGTARVA
VALAAAPGAV TSESVRAVAE GALLGTYSYD GLRTTSANGR PRPVEELTVL VDEQSLPTAE
EELRRATVVT DAVTLVRDLV NTPPSHLSPA LLADIAVERA TAAGVTVEVL DEVTLAEGGF
GGLLGVGQGS ANPPRLVRLE WRGGGSERPA LALVGKGITF DSGGLSLKPS TSMEWMKTDM
AGAAAVLGTV IAAARLKLPI TLTGWMPCAE NMPSGDAIRP SDVLTLRGGT RVEVLNTDAE
GRLVLADALV RANEESPDLI VDVATLTGAQ IVALGHRTSG LMGRAAAVDA VASAAAAADE
SVWPMPLPPD LRKGLDSTVA DIANVPPGGN RDGGMLVAAH FLASFVPEKV SWAHIDIAGP
SWNGGEPYGH TPKGGTGMIV RTLVQLAQDR ATTTASPGSA TGSPAESPAE G