Gene Franean1_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4180 
Symbol 
ID5672535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4970670 
End bp4972049 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content75% 
IMG OID641243053 
Productamino acid permease-associated region 
Protein accessionYP_001508470 
Protein GI158315962 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.875127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.262975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGAG GCATCGACAC CGGTCATGAG AAGCAGGCGG CCGGAGAGCG GACGACCGGC 
GACGAGAGCG GGGTCGACCC CCGGACCGAC CCGCGGGCCG ACACGGGGCC GGTGCTGGCC
CGCACGATCG GCACCGGGGA CGCGGTTGTC ATCGGGCTGG GCGCGATGCT CGGCGCCGGC
GTGTTCGCCG CGTTCGCTCC CGCGGCCCGG GCCGCGGGAG CGGCACTCCT GTTCGCCCTT
GCCACCGCGG CGATCGTGGC CTACTGCAAC GCCACCTCCT CGGCCCGGCT GGCGGCGCTG
TACCCGCAGG CCGGCGGCAC CTACGTCTAC GGCCGGCGGC GTCTCGGCGA CTTCTGGGGC
TACCTCGCCG GCGCCTCGTT CATCACCGGC AAGACCGCGT CGTGCGCCGC GATGGCTCTC
ACCGTCGGCG CCTACACCTG GCCTGGACAT GAGCGTCTGG TGGCGGGCAC CGCCACGGCC
GCCCTCACCG CGGTCAACTA CTCCGGGATG CGGCGCGCGG CCTGGCTGAC CCGTCTGATC
GTCGTCGTCG TCCTGACCGT GCTCGTCGTG GTCGTGGTCG TCTGCGTCAC CAGTGGCGCC
GCGGAGCCGG TCCGCCTCTC CCCCACCGGC CGCGACGGCC CGGCGGGGCC GGCATCGCCC
AGCCTGTGGC CCGGTCTCCC GCAGGCCGCC GGCCTGCTGT TCTTCGCCTT CGCCGGCTAT
GCGCGCATCG CGACGCTCGG CGAGGAGGTC CGTGACCCGC GGCGCACCAT CCGCCGGGCG
ATCCCGACCG CGCTGGGCAT CACGCTGGTC GTCTACGCCG CGGTCGCCGT CGCGTTGCTG
CTGGTGCTAC CGGTGGCCGC GCTCGGGGCG TCAGCCGCGC CGCTCACCGA CGCCGTGCGT
GCCGCCGGCA TGCCGGGTCT GGCTCCGGTG GTGCGGGTCG GGGCGGCCGT CGGTGCGCTC
GGGTCGTTGC TGGCGCTGAT CCTCGGTGTC TCGCGTACCA CCCTGGCGAT GGCCCGGGAC
GGCCATCTGC CGACGCCGCT GGCCGCCGTT CACCCCCGGT TCGGGGTTCC GCATCGCGCC
GAGCTCACCG TCGGGGCCGC CGTCACCGCG CTCGTGCTCA CCACGGACCT GCGCGGCGCG
ATCGGGTTCT CCTCGTTCGG GGTGCTGCTG TACTACGCGG TCGCCAACGC CGCGGCGTGG
ACCCTCTCCC CCACCGAAGG CCGTCCGGTG CGCGCCGTTC CGGTCGTCGG ACTGCTCGGC
TGCCTACTGC TGGCCTTCAG CCTGCCCGGG ACGTCCGTGC TCGGCGGGTC CGCGGTACTC
GCCGCGGCGG CCGTGGTGTA CGCCGCCCGT CACGCGGGCG GGCACCCGGC GCGGCAGTGA
 
Protein sequence
MVRGIDTGHE KQAAGERTTG DESGVDPRTD PRADTGPVLA RTIGTGDAVV IGLGAMLGAG 
VFAAFAPAAR AAGAALLFAL ATAAIVAYCN ATSSARLAAL YPQAGGTYVY GRRRLGDFWG
YLAGASFITG KTASCAAMAL TVGAYTWPGH ERLVAGTATA ALTAVNYSGM RRAAWLTRLI
VVVVLTVLVV VVVVCVTSGA AEPVRLSPTG RDGPAGPASP SLWPGLPQAA GLLFFAFAGY
ARIATLGEEV RDPRRTIRRA IPTALGITLV VYAAVAVALL LVLPVAALGA SAAPLTDAVR
AAGMPGLAPV VRVGAAVGAL GSLLALILGV SRTTLAMARD GHLPTPLAAV HPRFGVPHRA
ELTVGAAVTA LVLTTDLRGA IGFSSFGVLL YYAVANAAAW TLSPTEGRPV RAVPVVGLLG
CLLLAFSLPG TSVLGGSAVL AAAAVVYAAR HAGGHPARQ