Gene Franean1_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1246 
Symbol 
ID5669659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1499491 
End bp1501374 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content71% 
IMG OID641240178 
ProductXaa-Pro aminopeptidase 
Protein accessionYP_001505606 
Protein GI158313098 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.623221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00858649 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAGTC CCGCCATCGG GCCGGCCGTG GTCGCCACGG CGAAGGACGC TACCCGAGCA 
CGGCGGAAGC GGCCTACCGT TAGCGGCATG AGTGCCCAGT CCCACCAAAC ACCAGCTTCA
CGCCCGGTCG CGGGGCGATC CCCCGGCGGT ACGGCCGACG ACGTCGCACA CCGGACCACT
CCCGCTTCGG GCGGCGCGGA AGTGGAACGC GTCGCATCTT CGCCGGACAC CGGCACACCG
GGGCTCCCGG TGCCCCCCGC CGTACCACCG GGGAAATCGA TCCAGCCGGA TCTGGAATCT
CCGGAACCGA CCATCGGGAC AACCCGGGCA CAGGAGCCGG CGGGGGTAAC CATCGCACCG
GGGACTCCCC CCACCGCCGC AGACAGTCCG ACGACGGAGG AACAAGCCGC GGACGAACCG
GGCAGGACCG GACAGGCTAA GCACGACGAG GAACCCCACG CGGCCTTCAG GCGTTTCATG
GCGAGCGGCT GGGCGCCGGT GGATGATATC GTCGCGGTGC GTGACGACTG CGCCCCTTAC
ACCACGAAAC GGCGCTCGCT TCTGGCCACC CGATTCCCCA CCGAAGCGCT TGTCATACCA
AGCGGCGGGC TACACGTCCG GGCGAACGAC ACCGATTACC CGTTCCGCCC CGGCAGTGAC
TTCTTCTGGC TCACCGGATG TCACGAACCG GACGCCGTCC TGATCCTGCA TCCCACCGCT
GCCGGCGACC ATGACGCCGT GCTCTATCTC GCTGACCGAT CCGACCGATC GAGTTCCGCG
TTCTACACGG ACCGCCGTTA CGGCGAACTG TGGGTGGGCC CCCGGCCCGG TGTACGGGAG
ACCACAGCGG CTCTCGACAT CGAATGCCGG CCGCTGCCGG AGCTTCCCGA AGCACTGGCC
CGTCTCGCGC CCGCCAGGAC CCGCGTCGTG CGCGGGCTGG ACGCCCGGGT GGACCGCGCG
GTGAGCCGGT GGTCGCCGAC CGGCTCGTCC GCCGACCGGG ACGCCGCGCT GGCCGAGGTG
CTGTCCGAGC TCCGGCTGGT CAAGGACGAC TTCGAGATCG CCCGGCTGGA CGAGGCGGTC
GCGGCCACCG TGCTCGGTTT CACCGAGTGC GTGGGCGAGC TCGGCCGCGC GGCGACGCTC
CCCAACGGGG AGCGCTGGCT GGAGGGAACC TTCTGGCGAC GGGCCCGCGT CGACGGCAAC
GACGTCGGAT ACGGCTCCAT CGTGGCCTGC GGCCCGCACG CCACGACCCT GCACTGGGTG
CGCGACGACG GCCCAGTGCG GCCCGGTGAC CTGGCACTTC TCGACATGGG GGTCGAGGGC
CGCTCGCTGT ACACCGCCGA TGTGACGCGG ACGCTGCCGG TGAGCGGGCG CTTCAGCCCG
CTGCAGCGCC AGGTTCACGA GGTCGTCTAC CGGGCCCAGC AGGCCGGGAT AGACGCGGTC
CGTCCCGGCG CCGCCTTCCT GGATCCGCAC CGGGCCGCGA TGCGGGTGAT CGCGCAGGCG
CTGCACGACT GGGGATTGCT GCCGGCCACG GTCGAAGAGT CGCTCAGCGA GGACCCGAAA
GCACCCGGGG CCGGCCTGCA CCGGCGCTAC ACACTGCACT CCACGTCGCA CATGCTCGGG
CTGGACGTGC ACGACTGCGC GCAGGCGCGC GACGAGACCT ACCGCGACGC CGCGCTGGAG
GCCGGGATGG TGCTGACGGT CGAACCAGGG CTGTACTTCC AGCCGGACGA CCTCACTGTT
CCACCGGAGC TGCGCGGGAT CGGCGTCCGG ATAGAGGACG ACATCCTGGT CACGCCGGAT
GGAAGTCGGA ACATGTCAGC CGCGCTCGCA CGCTCGGCCG ACGATGTCGA AAAGTGGATG
GCCGGCGAGG CCGCCAGGCA CTGA
 
Protein sequence
MRSPAIGPAV VATAKDATRA RRKRPTVSGM SAQSHQTPAS RPVAGRSPGG TADDVAHRTT 
PASGGAEVER VASSPDTGTP GLPVPPAVPP GKSIQPDLES PEPTIGTTRA QEPAGVTIAP
GTPPTAADSP TTEEQAADEP GRTGQAKHDE EPHAAFRRFM ASGWAPVDDI VAVRDDCAPY
TTKRRSLLAT RFPTEALVIP SGGLHVRAND TDYPFRPGSD FFWLTGCHEP DAVLILHPTA
AGDHDAVLYL ADRSDRSSSA FYTDRRYGEL WVGPRPGVRE TTAALDIECR PLPELPEALA
RLAPARTRVV RGLDARVDRA VSRWSPTGSS ADRDAALAEV LSELRLVKDD FEIARLDEAV
AATVLGFTEC VGELGRAATL PNGERWLEGT FWRRARVDGN DVGYGSIVAC GPHATTLHWV
RDDGPVRPGD LALLDMGVEG RSLYTADVTR TLPVSGRFSP LQRQVHEVVY RAQQAGIDAV
RPGAAFLDPH RAAMRVIAQA LHDWGLLPAT VEESLSEDPK APGAGLHRRY TLHSTSHMLG
LDVHDCAQAR DETYRDAALE AGMVLTVEPG LYFQPDDLTV PPELRGIGVR IEDDILVTPD
GSRNMSAALA RSADDVEKWM AGEAARH