Gene Francci3_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3836 
Symbol 
ID3905584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4597379 
End bp4599106 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content74% 
IMG OID637881162 
Productleucyl aminopeptidase 
Protein accessionYP_482915 
Protein GI86742515 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCG TTGATTCCGC GTCAGGCCCG GCGATCGACG GGCTCATCAG GCTTCCTCCC 
GGCGTTACCC TACGGTCCGC CACCGTCGCC GACGTCGAGC CCTCGACCGT CGCCCTGGTC
CTCGCGTCGT CCGAGAACGG ACCCGTGTTC GACGAAACTG CCCGCGGTGT CGGCGCCGAT
CTCGGACTGG ATCTTGCGCT GCTGGTCGAG TCCGAGTCCC TGCGCGGCGA TGCGGGTTCG
GTGCTGGTGG TGCCGTTGGC CCGGACGGCT CGGCCGACCC GGCTGCTCGT GGTCGGCATC
GGCGCCGGTC AACCGGGTGA CTGGCGGGCC GCGGGCGCCG CGCTCGCCCG CCGCGCCGGG
GCACCGGACC GGCTCGCGGT GGTGGCGGAG CCCGGCGATC CGGGCCTGCG GGCCTTCACC
GAAGGCCTGG CGCTAGGCGC CTACCGGGCC GCTGGAGTGC TCGACCGGGC CGCTGGAGTG
CCCGACCCGG CCGGCCGGCC CGGGCCGGAG AACGGCGCGC CCGGCAACGT CATCGTGCTC
ACCGGTCGGG CGGACGAGCC CGGGGCGGTG GCCGCGGTCG GCGCCGCGCG GGCGGTGGCC
ACCGGGGTGT ACATCGCCCG CGATCTGGTC AACATGCCGA GTCTGGTGAA GTCACCCGAG
TGGCTGGCGA ACCGTGCCGT GCGCATCGCC GCGTCGGCGG GTCTCGACAC GACCCTGCTC
GGCCCCGACG ATCTTTCGGC GCAGGGCTTC GGGGGGTTGT GCGCCGTCGG TGAGGGTTCC
CCGCGGCCGC CCTACCTGGT CAAACTCGAA TATCACGGGC CACCGTGGAC TTCTGGCGAG
GCCGGGTTGG CGGGATCCAC CGGGTCGGCG GGATCCACCG GGTCGGCGGG ATCCACCGGG
TCGGCGGGAT CCATCGGGTC GGCGGGATCC ATCGGGTCGG CGGAGCCGGA TGGTTCGCCG
GCGGGCCGCT TCACCGATGG TCACCGGGTC CTGGTCGGGA AGGGAATCAC CTTCGACTCG
GGTGGGCTCT CCCTGAAGCC GGCCGTCCCG ATGGCCGGCA TGAAGACCGA CATGGCGGGC
GCGGCGGCGG TGCTCGGGGC GATGACCGCG TTGCCGGCGT TGAACGTGCC CGGGCGCGTC
ACCGGCCTGC TCTGTCTGGC GGAGAACATG ATCGGTGCGA CTGCCATGCG TCCCGGCGAC
GTCATCACCT GCTGGGGCGG GACCACGGTG GAGGTACTGA ACACCGACGC CGAGGGCCGC
CTGGTGCTGG CGGACGGCCT CGCCTACGCC GCGGGCGCGC TCGACGCGGA TGTCATCGTC
GATCTCGCCA CGCTGACCGG AGCGATCGCC GTGGCGCTCG GCCGGCGCAC CGCCGGGCTG
TTCAGCTCGG ACGACCGGCT GGCGGCGGCG CTGTCCGCCG CGGCGGACAG CGCCGGGGAA
CGGGTGTGGC GGCTGCCGTT GGTGAAGGAG TACCGGGCGG CGATCGACTC GCCGGTGGCG
GACCTTGCCA ACATCGGCCG GGCGCTGGAC GTCGGGGGCG GTTCCATCAC CGCGGCGCTG
TTCCTGCGGG AGTTCGCGGG CCGGCGGCCC TGGGCACATC TGGACATCGC GGGCACCGCA
CGGTCGGACG CCGACGACGG CGAGATCAGC CGGGGCGGCA CCGGGTGGGG GGTGCGTACC
CTGCTGACCT GGCTGTCGAG TGGGCCATCC CAGACACCGG CGGCCTGA
 
Protein sequence
MSIVDSASGP AIDGLIRLPP GVTLRSATVA DVEPSTVALV LASSENGPVF DETARGVGAD 
LGLDLALLVE SESLRGDAGS VLVVPLARTA RPTRLLVVGI GAGQPGDWRA AGAALARRAG
APDRLAVVAE PGDPGLRAFT EGLALGAYRA AGVLDRAAGV PDPAGRPGPE NGAPGNVIVL
TGRADEPGAV AAVGAARAVA TGVYIARDLV NMPSLVKSPE WLANRAVRIA ASAGLDTTLL
GPDDLSAQGF GGLCAVGEGS PRPPYLVKLE YHGPPWTSGE AGLAGSTGSA GSTGSAGSTG
SAGSIGSAGS IGSAEPDGSP AGRFTDGHRV LVGKGITFDS GGLSLKPAVP MAGMKTDMAG
AAAVLGAMTA LPALNVPGRV TGLLCLAENM IGATAMRPGD VITCWGGTTV EVLNTDAEGR
LVLADGLAYA AGALDADVIV DLATLTGAIA VALGRRTAGL FSSDDRLAAA LSAAADSAGE
RVWRLPLVKE YRAAIDSPVA DLANIGRALD VGGGSITAAL FLREFAGRRP WAHLDIAGTA
RSDADDGEIS RGGTGWGVRT LLTWLSSGPS QTPAA