Gene Franean1_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3919 
Symbol 
ID5672280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4686010 
End bp4687023 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content72% 
IMG OID641242798 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001508215 
Protein GI158315707 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0212257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTG ATCCTGTCCT GCAGGTGCGC AATCTGCGCG CCTACATCGG CACCCCGCGT 
GGAGTCGTGC GCGCCGTCGA CGACGTCTCC CTCGACCTCG ACTCCGCCCA GGCCATGGGC
GTGGTCGGGG AGTCCGGGTC CGGCAAGTCG GTGATGGCCC GCGCGATCAT GGGCCTGATG
CCGGCACGGT CCGGGTGCTC CGGCCAGGTG GTGTTCCAGG GCCGGGACCT GCTCACCCTC
CCCCGCAAGC AGCGCGCCGA GATCTGGGGC AAGCAGATCG CCATGGTGTT CCAGGACCCA
GGGCGCTCGC TCAACCCGGT GGTTCGGGTG GAACGGCAGC TCACCGAGGG AATGCGCAAG
CACCTCGGCG TCGGCCGCTC CGAAGCTCGC GGTCGGGCCC TCGACCTTCT GCGGGAGGTC
GGGGTGCCCG ACCCCGAGCG GCGCCTGCGC AACTACCCGC ACGAGCTCTC CGGCGGTATG
CGGCAGCGAG TCATGATCGC TACCGCGCTC GCCTGCGAGC CGACGCTGCT CATCGCCGAC
GAGCCGACCA CAGCGCTGGA CGTGACCGTC CAGCGCCAGA TCCTCGACCT GTTACGCCGG
GTCCAGCGCA ACCACGGAAT GTCCATGATC CTGATCAGCC ACGACCTGGC GGTCGTCGCC
GGCCGCACCG ACCGGGTCGC GGTGATGTAC GCGGGGCGCC TGGCCGAGGC CGGGGCCACC
CGGCAGGTCT TCGAGGCCCC CCGGCACCGC TACACCCACG CGCTGCTCGA GGCGACCCCG
ACCATCGACC ACGAGCGGCA CGCACCGATG CGGCTCATCC AGGGCTCGCT GCCCAACCCC
ATCGACCCGC CCCCCGGATG CCGGTTCGCC GCGCGCTGCG GGCACGTCGA CCCCAAGTGC
GCCGACCCCG GGCCGCAGAT GGTCTCCGTC GGCTCGGATC ACGATGTCGC CTGCATCAGC
CCGGTGCAGG ACGGCGCCGC GGTCGTCCCC GGAGGTGAGC TCGTTGGCCG GTAG
 
Protein sequence
MSTDPVLQVR NLRAYIGTPR GVVRAVDDVS LDLDSAQAMG VVGESGSGKS VMARAIMGLM 
PARSGCSGQV VFQGRDLLTL PRKQRAEIWG KQIAMVFQDP GRSLNPVVRV ERQLTEGMRK
HLGVGRSEAR GRALDLLREV GVPDPERRLR NYPHELSGGM RQRVMIATAL ACEPTLLIAD
EPTTALDVTV QRQILDLLRR VQRNHGMSMI LISHDLAVVA GRTDRVAVMY AGRLAEAGAT
RQVFEAPRHR YTHALLEATP TIDHERHAPM RLIQGSLPNP IDPPPGCRFA ARCGHVDPKC
ADPGPQMVSV GSDHDVACIS PVQDGAAVVP GGELVGR