Gene Franean1_2340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2340 
Symbol 
ID5670738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2787487 
End bp2789007 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content71% 
IMG OID641241259 
Productamino acid permease-associated region 
Protein accessionYP_001506680 
Protein GI158314172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.512672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGC GTCCGACCGC TATCGGCCGG ACGACAGCCG GCCAAGCCCG CGGGGACAGG 
ACAACCGATG ACGAGACGGC GGTGACCGGG CGAACGGCGC ACGCCGCACA GCCCACCCTG
GCGCGCGTCC TGAGCGGGCG GATGCTGTTG TTCCTCGTGC TCGGTGACGT CCTCGGATCG
GGGATCTACG CGCTCACCGG GCAGATCGCC GCCGACGTCG GCGGGATGAT CTGGGCACCG
TTCCTGATCG CCTTCGCACT GGCGTTCCTC ACCGGGACGG CCTACGCCGA GCTTGTCGGC
AAGTACCCCC GGGCGGCCGG AGCAGCACTC TACGCCAACG AGGCGTTCCG TCGCCCGTTT
CTCACCTTCA TGGTCGGCTT CACCGTGCTG GCCGCCGGGA TCACCTCCAC CGCCACGGCC
GCGCGGGCCA TTGGGGGCAA ATACCTCGCG GAGTTCGTGG CGGCCCCGAC GGTCGTCGTC
GCGGTCGTGT TCATCCTGCT GACCGCGGTC CTCAACGCCC GTGGGATCGA GGAGTCGGCC
CGGGTGAACC TTGTGCTCAC CGTGATCGAG CTGACCGGGC TGTTGCTGAT TGTCGGGATC
GGCGTGCACG CGCTGGCTAC CGGTGCGGGC GAACCGGGAC GGGCCTTCGA GCTGCGTGCC
GAGGGGCCTG CGCCGCTCGC CCTGCTCGCC GGCACGACGC TCGCGTTCTA CGCGATGCTC
GGCTTCGAGA ACTCGGTGAA CCTCGTCGAG GAGGCCCGCC GGCCGCGCCG GGACCTGCCC
CGGGCGCTGT TCGGCGGCAT GTTCATCGCC GGCACGGTCT ACCTGTTCGT CGCGTTCACC
TGCGCGATGC TGGTGGATCC CGACATCCTC ACCGGCTCCG GTGCCCCGCT GCTGGAAGTG
GTGAAGGCCT CCGACGTGCC GGTGCCGTCG AAGCTGTTCG CCGCGGTCGC GCTGGTCGCG
GTCGCTAACA CCGCCCTGAT GAACATGGTG ATGGCCTCCC GTCTCGTCTA CGGCATGGCT
GCCCAGGGCA CGATCCCGAC CGTCTTCGCG CGGGTGCTGC CCGGCCGACG CACCCCGCCG
ATAGCGATCG GCTTCGTGAC GGCCCTGACG GTGGCGCTGG CCAGCACCGG CCAGCTGGCC
GGCCTTGCCG ACACCACCGT CCTGCTCCTG CTGCTGGTCT TCTTCGTCGT CAACATCAGC
GCGCTGGTGC TGCGTCGCGA GCCCGTCCCG CCGGGCGCCT ACCGCGCCCC CACCTGGGCC
CCGGCACTCG GCGCGCTGTC CTGCGCGGCG CTGGCCACCC CGCTGGTCCA CCGCGAGGGC
ACCGTCTACC TGCGCGCCCT CGGCCTGCTC GCGCTCGCCG TCGTCCTCGT CCTTGCCCAG
AAAGTTGTCA GCGGCCGGTT CCGCGCGTGT GAGTTATCTG GTGCCGCCAG ACCGGCCCCG
TCACAGCCCT ACCCGACGGC GACGGGCACA GCCGACTTCT CTGGATCACG GACCTCCTCC
CCGACAGCCT GGCGGGCCTG A
 
Protein sequence
MNGRPTAIGR TTAGQARGDR TTDDETAVTG RTAHAAQPTL ARVLSGRMLL FLVLGDVLGS 
GIYALTGQIA ADVGGMIWAP FLIAFALAFL TGTAYAELVG KYPRAAGAAL YANEAFRRPF
LTFMVGFTVL AAGITSTATA ARAIGGKYLA EFVAAPTVVV AVVFILLTAV LNARGIEESA
RVNLVLTVIE LTGLLLIVGI GVHALATGAG EPGRAFELRA EGPAPLALLA GTTLAFYAML
GFENSVNLVE EARRPRRDLP RALFGGMFIA GTVYLFVAFT CAMLVDPDIL TGSGAPLLEV
VKASDVPVPS KLFAAVALVA VANTALMNMV MASRLVYGMA AQGTIPTVFA RVLPGRRTPP
IAIGFVTALT VALASTGQLA GLADTTVLLL LLVFFVVNIS ALVLRREPVP PGAYRAPTWA
PALGALSCAA LATPLVHREG TVYLRALGLL ALAVVLVLAQ KVVSGRFRAC ELSGAARPAP
SQPYPTATGT ADFSGSRTSS PTAWRA