Gene Francci3_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1554 
Symbol 
ID3904786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1863193 
End bp1865025 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content67% 
IMG OID637878891 
Productmajor facilitator transporter 
Protein accessionYP_480659 
Protein GI86740259 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.30485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACTT CCGACACCGC CGCCGGCACC CTGTCGCTTG AGGCCGACGA GGTCCTGGCC 
CGGCCCGGTC CGGACAACTA CAAGTGGATC GCGCTGTCGA ACGCGACCAT TGGCATCCTG
ATGGTGACGA TCAACAGTTC GATTCTTCTC ATCGCCCTTC CGGATATCTT CCGGGGAATC
GAGATAGACC CGCTCGCACC GGAGAACACC TCGTACTTCC TCTGGATCCT GATGGGATTC
CTGCTGGTCA CCTCGGTGCT CGTGGTGAGT CTGGGGCGGG TCGGCGACAT GTTCGGCCGG
GTGCGGATGT ACAACCTCGG CTTCGCGATC TTCACGCTGT TCTCGATCCT GCTCGCCGTC
ACCTGGATGC ATGGCACGGC AGCCGCCTGG TGGATCATCA TCATGCGGGT GCTGCAGGGC
GTGGGCGGTG CCTTCCTCTT CGCCAACTCC AGCGCGATCA TCACTGACGC CTTCCCGGAG
GACGAACGCG GGCTCGCCCT GGGGGTCAAC GGCGTCGCGG CGATCGTCGG ATCATTTCTG
GGGCTGTTGA TCGGCGGTCT GCTGGCGCCG GTCGAATGGC ATCTGGTCTT TCTTGTCTCG
GTGCCGTTCG GCATCTTCGG GACGGTCTGG GCCTACCTGA AGCTGCGCGA CAACGGGGCG
CGCACCCAGG CCCGGATCGA CTGGGCCGGC AACATCACGT TCGCGGTCGG CCTCATCGCG
ATCCTGACCG GCATCGTCTA CGGGCTGCAG CCCTATGGCG GTCACACGAT GGGCTGGACC
AAGCCGTTCG TGCTGAGCTG CCTGTTCGGT GGTCTCGCGG TGCTGATCGG CTTCGTCGTC
ATCGAGCTGC GCTCCGCCGA CCCGATGTTC CGCCTGGACC TGTTCCGGAG CCGGACCTTC
ACCATGGGCA GCATCGCGGC TCTGCTCGGC GCGCTCGCCC GTGGTGGTCT GCAGTTCATG
TTGATTATCT GGTTGCAGGG GATCTGGTTG CCGCTGCATG GCTACAGCTT CGAGAAGACC
CCGCTGTGGG CCGGCATCTA CCTGATTCCG GTGACCGTCG GATTCCTGGT GGCGGGGCCG
CTGGCCGGCC GGTTCGCGGA CCGCTACGGT GCGCGTCCGT TCGCCACCCT GGGACTGGTG
ATCACGGCGG TGGCGTTCCT GCTGTTCGAC GCCATTCCCA TCGACTTCGA CTATCCGTGG
TTCGCGCTGA TCCTGTTGCT GATGGGCCTG TCCATGGGCC TGTTCGCGGG GCCGAACACC
AGCAGCGTGA TGAACACCCT GCCGCCCAAC CAGCGCGGTG CCGGTGCCGG CATGCTCAAC
ACGTTCCAGA ACTCGGCCAG CGTGCTGTCC ATCGGTGTTT TCTTCACCAT CATCGCGCTC
GGGCTGGCCG CCAGCCTTCC GGACGCCATG TACTCCGGGC TTGTCGGGCA GGGCGTCTCC
CCGGCGAAGG CGCACGAGCT GGCGAACCTG CCGCCGATCG GCAGCCTGTT CGCCGCGTTC
CTCGGGTACA ACCCCACCGA GCGACTGCTC GGCCCGGACA CCCTGTCGCA GCTCGACCCG
GCGAAGGCCG ACTTCCTCAC CGGGCACACC TTCTTCCCGA ACCTCATCTC CGGGCCGTTC
GGTGACGGTC TGCGCCTCGC CTTCGCCTTC GCCGCCGTCG CCTGCCTGGT CGCCGCGGGC
TTCTCCTGGC TGCGCGGGAA GCAGCGGCCG CACGTGCGCC GTCCGCTGCT CGAAGAGACG
GCCGAGGGGC TGGCCGGCGC GGGCGACATC GCGGCGATGG AGGACGGTGC CGGGAGCGCT
CTTTCGAGCA GCCCCCTGGC CGCCGAGCGA TAG
 
Protein sequence
MTTSDTAAGT LSLEADEVLA RPGPDNYKWI ALSNATIGIL MVTINSSILL IALPDIFRGI 
EIDPLAPENT SYFLWILMGF LLVTSVLVVS LGRVGDMFGR VRMYNLGFAI FTLFSILLAV
TWMHGTAAAW WIIIMRVLQG VGGAFLFANS SAIITDAFPE DERGLALGVN GVAAIVGSFL
GLLIGGLLAP VEWHLVFLVS VPFGIFGTVW AYLKLRDNGA RTQARIDWAG NITFAVGLIA
ILTGIVYGLQ PYGGHTMGWT KPFVLSCLFG GLAVLIGFVV IELRSADPMF RLDLFRSRTF
TMGSIAALLG ALARGGLQFM LIIWLQGIWL PLHGYSFEKT PLWAGIYLIP VTVGFLVAGP
LAGRFADRYG ARPFATLGLV ITAVAFLLFD AIPIDFDYPW FALILLLMGL SMGLFAGPNT
SSVMNTLPPN QRGAGAGMLN TFQNSASVLS IGVFFTIIAL GLAASLPDAM YSGLVGQGVS
PAKAHELANL PPIGSLFAAF LGYNPTERLL GPDTLSQLDP AKADFLTGHT FFPNLISGPF
GDGLRLAFAF AAVACLVAAG FSWLRGKQRP HVRRPLLEET AEGLAGAGDI AAMEDGAGSA
LSSSPLAAER